Home > Archive > General Discussion > October 2002 > Search Engine ?





You are viewing an archived Text-only version of the thread. To view this thread in it's original format and/or if you want to reply to this thread please [click here]

Author Search Engine ?
kappagamma698

2002-10-25, 4:18 pm

Hey, have a question I have been asked to be on a Six Sigma project for the company and part of the project is going to include creating a search engine and I was wondering what language is the best to use for this ASP, JAVA or another. I would like to know so I can try to start to learn it or brush up on the language. Any help would help and any links would also I already have a dozen or so but need many more to figure the best route.
Thanks in advance
fmusick

2002-10-30, 11:53 pm

What server?

If you're on a Windows server a search engine will take you about an hour (full text, keywords, whatever you want) just use a FileSystemObject and open all the html files as text. Run a Regular Expression match on them and Response.Write the results page. There are other ways to do it including a database of keywords and corresponding pages and then query the database, this is a lot more scalable but requires more input up front. As for a full text search in a database, I think you could load the entire page as a field but other than that you got me.

As for Java, it can be done similar ways. The Response.Write would be replaced by document.write. My Java's a little rusty and so I can't think of the corresponding twin to FileSystemObject.
kappagamma698

2002-10-31, 9:18 am

Thanks for the info, are there any good books on how to create a search engine? I have not been told what server they are going to place this on if it is going to be a unix server or a NT server, most of the websites are hosted on unix. But this is something that we will discuss to figure out the best way for costs of course. Any other help would be great thanks for the proir help
fmusick

2002-10-31, 6:26 pm

On the languages themselves, there's manuals, but to my knowledge no one has written one specifically for search engines. You may try some internet searches but you will have to sort through thousands of links. A specific search for "Build your own search engine" might be your best best.

As for which way to go NT or Unix, that debate has been raging for years. Do you have a webmaster yet or is one going to be hired? I'm a Windows guy and so I can't give an unbiased view. If you go with Unix, I would recommend using perl. I would bet someone has already written at least one module for search engines but it would be relatively easy if you don't find one that does what you want. Same basic premise, open text files do a compare, output results.

ASP (VBScript) would probably be the easiest to pickup, but ASP and Unix don't mix. Perl doesn't always get along with NT/2000 but loves Unix boxes. Java is cross platform. You just install the intepreter. I think it forces you to write way too much code though. (just a personal opinion, no offense to Java or Java programmers).
Sponsored Links





Free Braindumps | MCSE braindumps software forum

Copyright 2003 - 2008 examnotes.net