Resources

Online Tools 

Word Stemming http://www.tartarus.org/~martin/PorterStemmer/index.html
HTML Validation http://validator.w3.org/
Near Dup Content Detection http://www.copyscape.com/
 Research Papers: 

Link Spam Detection Based
on Mass Estimation
Hilltop: A Search Engine based
on Expert Documents
http://www.cs.toronto.edu/pub/reports/csrg/405/hilltop.html
SemanticWeb.org http://www.semanticweb.org/
WWW9 http://www9.org/w9cdrom/

News Letters:

Search Engine Watch http://www.searchenginewatch.com/
Planet Ocean http://www.searchenginenews.com/
Pandia Post http://www.pandia.com/post/index.html

Blogs:

Matt Cutts (Google) http://www.mattcutts.com/blog/
Yahoo Blog http://www.ysearchblog.com/
MSN Blog http://blogs.msdn.com/msnsearch/

 

Forums:

WebmasterWorld http://www.webmasterworld.com/
Search Engine Forums http://www.searchengineforums.com/
Search Engine Watch Forums http://forums.searchenginewatch.com/index.php
SitePoint Forums http://www.sitepoint.com/forums/

 

Patents: (A)Indicates Application Pending

All Google Patents Complete List of Google Patents
(A) Nov. 3, 2005 Profile based capture component
Sept. 6, 2005 Methods and apparatus for determining equivalent descriptions for an information need
August 23, 2005 Address geocoding
(A) July 7, 2005 Systems and methods for direct navigation to specific portion of target document
(A) July 7, 2005 Generating hyperlinks and anchor text in HTML and non-HTML documents
(A) July 7, 2005 Systems and methods for improving search quality
March 8, 2005 Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query
June 22, 2004 Techniques for finding related hyperlinked documents using link-based analysis
April 20, 2004 Ranking search results by reranking the results based on local inter-connectivity
Jan. 13, 2004 Information extraction from a database
Dec. 2, 2003 Detecting duplicate and near-duplicate files
Sept. 2, 2003 Detecting query-specific duplicate documents
March 4, 2003 Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query
Feb. 25, 2003 Ranking search results by reranking the results based on local inter-connectivity