Back to Results
First PageMeta Content
Reputation management / Digital media / Spamming / World Wide Web / Spamdexing / PageRank / Spam / Google Search / TrustRank / Internet / Computing / Link analysis


Soft-404 Pages, A Crawling Problem Víctor M. Prieto, Manuel Álvarez, Fidel Cacheda Comunications and Information Technologies Department Facultade de Informática Universidade da Coruña (University of A Coruna)
Add to Reading List

Document Date: 2014-05-16 22:36:41


Open Document

File Size: 329,29 KB

Share Result on Facebook

Company

Google / Soft / /

/

Event

Product Recall / Product Issues / /

Facility

University of A Coruna / /

IndustryTerm

web content analysis / classification algorithm / Web Decay / Web Spam detection / dataset containing web pages / crawler systems / web content / random web page / web servers / web server returns / Web Volume / web browsers / singlenode search system / search engines / Web Spam / web page content / web page existing / data mining / Web Server/Server / root web pages / classification algorithms / appropriate algorithm / web server / neuronal networks / main Web Spam / search engine / multi-site web search architecture / normal processing / /

MarketIndex

set 680 / /

Organization

Universidade da Coruña / University of A Coruna / European Union / /

Person

Víctor M. Prieto / Manuel Álvarez / Fidel Cacheda / /

Position

Official / General / /

Product

Recall Precision / /

Technology

4 Stanford Digital Libraries Technologies / HTTP protocol / DNS / classification algorithms / HTTP / Data Mining / caching / search engine / machine learning / C4.5 algorithm / HTML / C4.5 classification algorithm / web server / /

URL

http /

SocialTag