View Full Version : Need to locate all PDF files on web site, is there a tool?

Is there a freeware/shareware utility available that will scan a web site and give URLs for document files such as PDFs and DOCs? Need to do an audit of files that have been uploaded to our intranet.

Unfortunately when I search I keep getting anti-virus scanning tools in the results, which is not what I'm looking for. :(

Any help would be greatly appreciated.



Hmm, this is what I got when I searched for “website file indexing software (http://www.google.com/search?q=website+file+indexing+software)”. I’m sure there is something among these results that suits your needs. You just need to know which keywords to use.

Do you happen to be on a Linux server and have access to the command line? This won't help if you're just looking for an http solution, but will find files.

ls -ARl --group-directories-first /path/to/site/root/ |grep -i PDF

The drawback is that it doesn't give a path, but if it's a Linux box, you can just use a locate filename.pdf to get the location.