[Xerte] how to strip out a specific string?

Ron Mitchell ronm at mitchellmedia.co.uk
Sun Apr 12 20:19:55 BST 2009


Cheers Dave that's very helpful - I'll give that a try.

On 12 Apr 2009, at 19:35, Dave Burnett wrote:

>
> It actually does have a closing tag: />
>
> XML shorthands the closing tag to save characters when all the info  
> is contained in the attributes, i.e. there is no actual value  
> between the tags.
>
> <name>jamesclay (James Clay)</name> //no attributes, data between tags
>
>
> <link type="text/html" rel="alternate" href="http://twitter.com/jamesclay/statuses/1483252089 
> "/>
>
> //all data in opening tag attributes, no explicit closing tag, just />
>
>
> So if you know the format you show below would be consistent, you  
> could key on that "/>" as the end of the string and RFind (reverse  
> find) back to the first "/" you hit.
>
> That should isolate the actual picture name/format.
>
> Dave
>
>
> From: ronm at mitchellmedia.co.uk
> To: xerte at lists.nottingham.ac.uk
> Date: Sun, 12 Apr 2009 19:22:22 +0100
> Subject: [Xerte] how to strip out a specific string?
>
> Hi all
> I've been experimenting with the new module method of creating  
> templates and have created a couple of new templates optimised for a  
> specific purpose which I’ll happily share with the community once  
> complete. One is an optimised  view of twitter feeds but unless I’m  
> mistaken it seems like the xml isn't complete for some reason. I can  
> pick up and display the text and links etc ok but I can't work out  
> how to access the profile pics. Here's an example entry...
>
> <entry>
>     <id>tag:search.twitter.com,2005:1483252089</id>
>     <published>2009-04-09T12:59:26Z</published>
>     <link type="text/html" rel="alternate" href="http://twitter.com/jamesclay/statuses/1483252089 
> "/>
>     <title>@ronm123 I meant in a browser, via a second projector  
> probably.</title>
>     <content type="html">&lt;a href="http://twitter.com/ 
> ronm123"&gt;@ronm123&lt;/a&gt; I meant in a browser, via a second  
> projector probably.</content>
>     <updated>2009-04-09T12:59:26Z</updated>
>     <link type="image/png" rel="image" href="http://s3.amazonaws.com/twitter_production/profile_images/96243182/Photo14_normal.jpg 
> "/>
>     <twitter:source>&lt;a href="http://twitter.com/"&gt;web&lt;/ 
> a&gt;</twitter:source>
>     <author>
>       <name>jamesclay (James Clay)</name>
>       <uri>http://twitter.com/jamesclay</uri>
>     </author>
>   </entry>
>
> I want to be able to strip out the path to the .jpg e.g.http://s3.amazonaws.com/twitter_production/profile_images/96243182/Photo14_normal.jpg
>
> But can't work out how to do that. I can access the content of  
> <id>,<title> and even <name> but the tag containing the profile pic  
> link e.g. <link type=”image/png” doesn’t have a closing tag.  
> Presumably I need some kind of function + regular expression - any  
> suggestions?
>
> Cheers
> Ron
>
> Rediscover Hotmail®: Get e-mail storage that grows with you. Check  
> it out._______________________________________________
> Xerte mailing list
> Xerte at lists.nottingham.ac.uk
> http://lists.nottingham.ac.uk/mailman/listinfo/xerte

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.nottingham.ac.uk/pipermail/xerte/attachments/20090412/285598d4/attachment.html


More information about the Xerte mailing list