Profile

Click to view full profile
Hi, I'm Veerapat Sriarunrungrueang, an expert in technology field, especially full stack web development and performance testing.This is my coding diary. I usually develop and keep code snippets or some tricks, and update to this diary when I have time. Nowadays, I've been giving counsel to many well-known firms in Thailand.
view more...

Tuesday, November 1, 2011

The Lemur Project - SEO Tools

Lemur is an opensource toolkit for Information Retrieval and language modeling, It also provides API in C++, JAVA, C#, etc. I used it to check ranking among my website pages and competitor pages.
To use this tool, you have to download it first from http://www.lemurproject.org/.
Then, to allow Lemur indexing your pages you need to put Lemur' headers for all of your indexed pages as the following code:

<doc>
<docno>FULL PATH TO THIS FILE<⁄docno>
<dochdr><⁄dochdr>
.
.
.
YOUR HTML CONTENT
.
.
.
<⁄doc>

Next, let's see urllist.txt, you need to put all of full path of your indexed files one line per one file path.
C:\seo\competitors\aol\1.html
C:\seo\competitors\bbc\1.html
C:\seo\competitors\bbc\2.html
.
.
.

After that, edit queries.txt, insert keywords with one keyword per line.
carbon dioxide
cfc
fuel
.
.
.

Assume that my Lemur' folder is named seo. Do as follows:
1. Open a command window.
2. Go to seo directory by type in the following commnad.
cd C:\seo
3. To index the web pages (both competitors and cfs), type this
C:\seo\lemur\bin\BuildIndex.exe buildparam.txt
4. To run a query, type this
C:\seo\lemur\bin\RunQueriesHTML.exe queryparam.txt
5. Open the result file located in C:\seo\results.html with you web browser

Finally, you will see the result similar to the following picture:










Enjoy using it xD.

1 comment: