Go Back   CodingForums.com > :: Server side development > Other server side languages/ issues > Python

Before you post, read our: Rules & Posting Guidelines

Reply
 
Thread Tools Rate Thread
Enjoy an ad free experience by logging in. Not a member yet? Register.
Old 08-02-2011, 10:09 AM   PM User | #1
mexabet
New Coder

 
Join Date: Sep 2008
Location: Mexabet.biz
Posts: 81
Thanks: 12
Thanked 0 Times in 0 Posts
mexabet is an unknown quantity at this point
In What Way Is Python Suitable For Programming A Web Search Engine?

My new project is to build a web search engine with a web spider, and I'm thinking of using three languages, namely, Python, Java and C++. Now I'm somewhat confused about which programming language is well-suited for creating a web crawler, content indexer, ranking algorithm and searching mechanism.

I fully agree that some programming languages deliver optimal performance for certain tasks, and lag behind in other areas. So, we want to make the right choices. A friend of my suggested that I use C++ to develop features that demand ultimate speed and Python for glue code that is not very time-critical. But I'm not yet too sure of the exact features that will require absolute speed, so you may want to enlighten me.

Now my questions are:
  1. Where should Python come in? Which features should it be used for?
  2. Which language (C++ or Java) is most suitable for developing a web crawler and why?
  3. Which language is best suited for developing a search ranking algorithm - C++ or Java?
  4. Which features of the search engine should C++ be used for?
  5. Which features should Java be used for?
  6. Do these three languages make a good combination when developing a search application?
  7. Which database management system will be excellent for this type of application? Will MySQL be reliable or is there a higher-level database system that will be most suitable?

Please, enlighten me on the above-mentioned points, so that I'll be more equipped to get down to work. Any positive response and suggestion will be highly appreciated.

Last edited by mexabet; 08-02-2011 at 12:04 PM..
mexabet is offline   Reply With Quote
Old 10-03-2011, 01:44 AM   PM User | #2
fatecaresx13
New Coder

 
Join Date: Jan 2010
Posts: 29
Thanks: 0
Thanked 2 Times in 2 Posts
fatecaresx13 is an unknown quantity at this point
I would say that you'd use C++ (or Java I prefer C though) for the actual searching portion. Maybe like a command line interface. ./search [options] query_terms or something that returns a reusable interface and I assume you're going to have the application usable from the web so you could use something like php to call it.

Keep in mind that doing direct calls can be dangerous because you could get injected. i.e. shell_exec("./search query; rm -rf /") so that may be where you'd want to be real careful.

I love python but I feel like you don't necessarily need it in this case. Of course there are infinitely many possibilities for this kind of thing.


Anthony
fatecaresx13 is offline   Reply With Quote
Old 10-03-2011, 05:12 AM   PM User | #3
mexabet
New Coder

 
Join Date: Sep 2008
Location: Mexabet.biz
Posts: 81
Thanks: 12
Thanked 0 Times in 0 Posts
mexabet is an unknown quantity at this point
@fatecaresx13

Thanks for your input.

I'm not planning to make direct calls, and I prefer Python to PHP.
mexabet is offline   Reply With Quote
Reply

Bookmarks

Jump To Top of Thread


Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 04:48 PM.


Advertisement
Log in to turn off these ads.