whopub
11-30-2009, 05:57 AM
Hi,
I'm looking for an app, or online form, to extract image URLs from HTML code saved on TXT files. To be taken from <IMG SRC> tags, to be more exact.
I have several code snippets like this:
<img src="http://dummy.site.com/here/images/09/10065/file01.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file02.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file03.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file04.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file05.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file06.jpg" width="64" height="100" alt="image title" />
image name
And I need an automated way to extract just the URLs, and save them on a TXT file like this:
http://dummy.site.com/here/images/09/10065/file01.jpg
http://dummy.site.com/here/images/09/10065/file02.jpg
http://dummy.site.com/here/images/09/10065/file03.jpg
http://dummy.site.com/here/images/09/10065/file04.jpg
http://dummy.site.com/here/images/09/10065/file05.jpg
http://dummy.site.com/here/images/09/10065/file06.jpg
One URL per line.
The code snippets are not too big, just a bit over 100 entries for the bigger ones. I don't care if I have to do it one TXT at a time. Beats doing the whole thing by hand.
This is the sort of thing that makes me mad for not being a programmer! Any one of you guys could probably come up with a number of ways to pull this off in just a couple of minutes.
And I'm quite sure the tools to pull it off are already out there, but trying a search for it... well, let's just say there's way too much out there, and installing small random apps is really not safe.
I may be completely wrong, but I think I was able to feed code like this to flashget, and he'd just go through the whole thing and listed the actual URLs it found on a confirmation box, allowing me then to select just a few and copy them to the clipboard, in the exact same one-URL-per-line format I need here. But somehow my flashget installation got screwed and now I can't figure out what version I was using. Already tested 4 different ones and none of them seems to be able to do that.
I need those URLs in that format so I can then batch replace URL segments and, finally, feed the updated URLs to flashget. But the first step is extracting the initial URL from that code.
So, any ideas?
Thanks.
PS: hope I'm not screwing up but posting this here, but I really couldn't find a better match... And it IS HTML related, I guess.
I'm looking for an app, or online form, to extract image URLs from HTML code saved on TXT files. To be taken from <IMG SRC> tags, to be more exact.
I have several code snippets like this:
<img src="http://dummy.site.com/here/images/09/10065/file01.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file02.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file03.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file04.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file05.jpg" width="64" height="100" alt="image title" />
image name
<img src="http://dummy.site.com/here/images/09/10065/file06.jpg" width="64" height="100" alt="image title" />
image name
And I need an automated way to extract just the URLs, and save them on a TXT file like this:
http://dummy.site.com/here/images/09/10065/file01.jpg
http://dummy.site.com/here/images/09/10065/file02.jpg
http://dummy.site.com/here/images/09/10065/file03.jpg
http://dummy.site.com/here/images/09/10065/file04.jpg
http://dummy.site.com/here/images/09/10065/file05.jpg
http://dummy.site.com/here/images/09/10065/file06.jpg
One URL per line.
The code snippets are not too big, just a bit over 100 entries for the bigger ones. I don't care if I have to do it one TXT at a time. Beats doing the whole thing by hand.
This is the sort of thing that makes me mad for not being a programmer! Any one of you guys could probably come up with a number of ways to pull this off in just a couple of minutes.
And I'm quite sure the tools to pull it off are already out there, but trying a search for it... well, let's just say there's way too much out there, and installing small random apps is really not safe.
I may be completely wrong, but I think I was able to feed code like this to flashget, and he'd just go through the whole thing and listed the actual URLs it found on a confirmation box, allowing me then to select just a few and copy them to the clipboard, in the exact same one-URL-per-line format I need here. But somehow my flashget installation got screwed and now I can't figure out what version I was using. Already tested 4 different ones and none of them seems to be able to do that.
I need those URLs in that format so I can then batch replace URL segments and, finally, feed the updated URLs to flashget. But the first step is extracting the initial URL from that code.
So, any ideas?
Thanks.
PS: hope I'm not screwing up but posting this here, but I really couldn't find a better match... And it IS HTML related, I guess.