Hello and welcome to our community! Is this your first visit?
Register
Enjoy an ad free experience by logging in. Not a member yet? Register.
Results 1 to 5 of 5
  1. #1
    New Coder
    Join Date
    Apr 2012
    Posts
    85
    Thanks
    7
    Thanked 0 Times in 0 Posts

    What to use for Web scraping?

    I need to scrape a Chinese website. The api looks dodgy, my virus checker won't let me go to the website as it says its flagged as unsafe.

    I searched on Google and found one article that comes 1st or 2nd in results.

    At the top of the list is Goutte.
    Great I thought. I'll use that.
    I can't find any YouTube videos on it.
    Not a single one (there might be some foreign language ones + I only looked on the first page of results).

    Hmm odd. But then I looked at the 2nd and 3rd and 4th suggested code. I can't seem to find YouTube videos.

    YouTube videos seem to use curl?

    Just want a few recommendations.

    Thanks.

  2. #2
    Master Coder Dormilich's Avatar
    Join Date
    Jan 2010
    Location
    Behind the Wall
    Posts
    5,842
    Thanks
    26
    Thanked 609 Times in 602 Posts
    Quote Originally Posted by OM2 View Post
    I can't seem to find YouTube videos.
    I wouldn't recommend YouTube as primary documentation source... Instead check out the project itself first: https://github.com/FriendsOfPhp/Goutte
    The computer is always right. The computer is always right. The computer is always right. Take it from someone who has programmed for over ten years: not once has the computational mechanism of the machine malfunctioned.
    André Behrens, NY Times Software Developer

  3. #3
    New Coder
    Join Date
    Apr 2012
    Posts
    85
    Thanks
    7
    Thanked 0 Times in 0 Posts
    @Dormilich youtube is great i think for seeing someone walk through.
    Would you recommend yourself using Goutte?
    This was all I was after - just a solid recommendation of what to use.
    Let me know.
    Thanks.

  4. #4
    Master Coder Dormilich's Avatar
    Join Date
    Jan 2010
    Location
    Behind the Wall
    Posts
    5,842
    Thanks
    26
    Thanked 609 Times in 602 Posts
    I never needed web scraping, so I can't tell.
    The computer is always right. The computer is always right. The computer is always right. Take it from someone who has programmed for over ten years: not once has the computational mechanism of the machine malfunctioned.
    André Behrens, NY Times Software Developer

  5. #5
    Supreme Master coder!
    Join Date
    Jun 2003
    Location
    Cottage Grove, Minnesota
    Posts
    10,386
    Thanks
    10
    Thanked 1,191 Times in 1,181 Posts
    Can you describe what you need to scrape and even post a link to it?
    I realize you think it might not be "safe", but is it a website that is Rated-G?

    The mention of CURL is a PHP method of accessing API's. Do you have to log into the site and use the API?

    If I knew (and could see) the information you are trying to scrape, I might have a better answer (or not).


 

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •