Folgen
Marc Najork
Marc Najork
Google DeepMind
Bestätigte E-Mail-Adresse bei google.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Detecting spam web pages through content analysis
A Ntoulas, M Najork, M Manasse, D Fetterly
Proceedings of the 15th international conference on World Wide Web, 83-92, 2006
8952006
Mercator: A scalable, extensible web crawler
A Heydon, M Najork
World Wide Web 2 (4), 219-229, 1999
8671999
A large-scale study of the evolution of web pages
D Fetterly, M Manasse, M Najork, J Wiener
Proceedings of the 12th international conference on World Wide Web, 669-678, 2003
8342003
Breadth-first crawling yields high-quality pages
M Najork, JL Wiener
Proceedings of the 10th international conference on World Wide Web, 114-118, 2001
6332001
Web crawling
C Olston, M Najork
Foundations and Trends® in Information Retrieval 4 (3), 175-246, 2010
6132010
Spam, damn spam, and statistics: Using statistical analysis to locate spam web pages
D Fetterly, M Manasse, M Najork
Proceedings of the 7th International Workshop on the Web and Databases …, 2004
4732004
On near-uniform URL sampling
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 33 (1-6), 295-308, 2000
3432000
Position Bias Estimation for Unbiased Learning to Rank in Personal Search
X Wang, N Golbandi, M Bendersky, D Metzler, M Najork
11th ACM International Conference on Web Search and Data Mining, 2018
2802018
Boxwood: Abstractions as the Foundation for Storage Infrastructure.
J MacCormick, N Murphy, M Najork, CA Thekkath, L Zhou
OSDI 4, 8-8, 2004
2802004
Learning to rank with selection bias in personal search
X Wang, M Bendersky, D Metzler, M Najork
39th International ACM SIGIR Conference on Research and Development in …, 2016
2742016
Automatically Creating Training Data For Language Identifiers
M Goldszmit, M Najork, S Paparizos
US Patent App. 13/943,788, 2015
2312015
On the evolution of clusters of near-duplicate web pages
D Fetterly, M Manasse, M Najork
Proceeding of the 1st Latin American Web Congress, 37-45, 2003
2232003
High-performance web crawling
M Najork, A Heydon
Handbook of massive data sets, 25-45, 2002
2162002
WIT: Wikipedia-based image text dataset for multimodal multilingual machine learning
K Srinivasan, K Raman, J Chen, M Bendersky, M Najork
44th International ACM SIGIR Conference on Research and Development in …, 2021
2122021
SOCIAL NETWORK RECOMMENDED CONTENT AND RECOMMENDING MEMBERS FOR PERSONALIZED SEARCH RESULTS
T Harrington, R Shenoy, M Najork, R Panigrahy
US Patent App. 13/252,215, 2013
2102013
Measuring index quality using random walks on the Web
MR Henzinger, A Heydon, M Mitzenmacher, M Najork
Computer Networks 31 (11-16), 1291-1303, 1999
2081999
Detecting phrase-level duplication on the world wide web
D Fetterly, M Manasse, M Najork
Proceedings of the 28th annual international ACM SIGIR conference on …, 2005
1902005
System and method for associating an extensible set of data with documents downloaded by a web crawler
MA Najork, CA Heydon
US Patent 6,351,755, 2002
1832002
A sketch-based distance oracle for web-scale graphs
A Das Sarma, S Gollapudi, M Najork, R Panigrahy
Proceedings of the third ACM international conference on Web search and data …, 2010
1702010
Web crawler system using plurality of parallel priority level queues having distinct associated download priority levels for prioritizing document downloading and maintaining …
MA Najork, CA Heydon, JL Wiener
US Patent 6,263,364, 2001
1642001
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20