text-based algorithm / content management systems / web content / randomized algorithms / template detection algorithms / personal web site / important and growing tools / large-scale web crawler / web changes / motivational algorithms / media sites / web browsers / web site design software / web pages using features / search engines / web designers / web mining / data mining / Internet Archive / average media / local media outlets / web data / Text-based algorithms / /
Organization
Web Page Templates David Gibson Kunal Punera Andrew Tomkins IBM Almaden Research Center / University of Texas at Austin / International World Wide Web Conference Committee / Electrical and Computer Engineering University / /
Person
David Gibson Kunal Punera Andrew / Andrew Tomkins / / /
Position
browsing assistant / Text processing General / /
ProgrammingLanguage
HTML / /
ProvinceOrState
Texas / California / /
Technology
ALGORITHMS The algorithms / content management / DOM-based algorithms / text-based algorithm / 3.2 Text-based algorithm / data mining / analysis algorithms / caching / DOM-based algorithm / DOM / template detection algorithms / machine learning / 3.1 DOM-based algorithm This algorithm / HTML / Text-based algorithms / two algorithms / /