Hello and welcome to our community! Is this your first visit?
Register
Enjoy an ad free experience by logging in. Not a member yet? Register.
Results 1 to 3 of 3
  1. #1
    New Coder
    Join Date
    Sep 2008
    Location
    Mexabet.biz
    Posts
    86
    Thanks
    12
    Thanked 0 Times in 0 Posts

    In What Way Is Python Suitable For Programming A Web Search Engine?

    My new project is to build a web search engine with a web spider, and I'm thinking of using three languages, namely, Python, Java and C++. Now I'm somewhat confused about which programming language is well-suited for creating a web crawler, content indexer, ranking algorithm and searching mechanism.

    I fully agree that some programming languages deliver optimal performance for certain tasks, and lag behind in other areas. So, we want to make the right choices. A friend of my suggested that I use C++ to develop features that demand ultimate speed and Python for glue code that is not very time-critical. But I'm not yet too sure of the exact features that will require absolute speed, so you may want to enlighten me.

    Now my questions are:
    1. Where should Python come in? Which features should it be used for?
    2. Which language (C++ or Java) is most suitable for developing a web crawler and why?
    3. Which language is best suited for developing a search ranking algorithm - C++ or Java?
    4. Which features of the search engine should C++ be used for?
    5. Which features should Java be used for?
    6. Do these three languages make a good combination when developing a search application?
    7. Which database management system will be excellent for this type of application? Will MySQL be reliable or is there a higher-level database system that will be most suitable?


    Please, enlighten me on the above-mentioned points, so that I'll be more equipped to get down to work. Any positive response and suggestion will be highly appreciated.
    Last edited by mexabet; 08-02-2011 at 12:04 PM.

  • #2
    New Coder
    Join Date
    Jan 2010
    Posts
    29
    Thanks
    0
    Thanked 2 Times in 2 Posts
    I would say that you'd use C++ (or Java I prefer C though) for the actual searching portion. Maybe like a command line interface. ./search [options] query_terms or something that returns a reusable interface and I assume you're going to have the application usable from the web so you could use something like php to call it.

    Keep in mind that doing direct calls can be dangerous because you could get injected. i.e. shell_exec("./search query; rm -rf /") so that may be where you'd want to be real careful.

    I love python but I feel like you don't necessarily need it in this case. Of course there are infinitely many possibilities for this kind of thing.


    Anthony

  • #3
    New Coder
    Join Date
    Sep 2008
    Location
    Mexabet.biz
    Posts
    86
    Thanks
    12
    Thanked 0 Times in 0 Posts
    @fatecaresx13

    Thanks for your input.

    I'm not planning to make direct calls, and I prefer Python to PHP.


  •  

    Posting Permissions

    • You may not post new threads
    • You may not post replies
    • You may not post attachments
    • You may not edit your posts
    •