Back to Results
First PageMeta Content
World Wide Web / Heritrix / Focused crawler / Web harvesting / Web archiving / Robots exclusion standard / Web search engine / Distributed web crawling / Information science / Web crawlers / Information retrieval


Document Date: 2013-09-23 08:37:31


Open Document

File Size: 149,31 KB

Share Result on Facebook

City

Montreuil / Athens / Paris / /

Company

Avery / Twitter / Heritrix / CNN / /

Country

France / /

Facility

ATHENA Research Center / URL store / University of Hannover / /

IndustryTerm

online phase runs / Online analysis The prioritization module / web content / web applications / social media / open source archival quality web crawler / web forum / adaptive query processing / online analysis phase / online and offline analysis modules / Web Archiving / web page type / exhaustive web harvesting approaches / web application / web objects / distributed web crawler / Web Archiving Workshop / scheduled web page / selective web harvesting / Web Crawling / online phase / crawled web pages / social network / web service / developed web service / Web Harvesting / Web application type detection patterns / large-scale selective Web harvesting The workflow / online analysis modules / matched web application / Web Engineering / /

Movie

Paris / France 2 / Paris / France 2 4 / /

Organization

ATHENA Research Center / Greece Internet Memory Foundation / European Commission / Germany Institut Mines-T´el´ecom / University of Hannover / /

Person

Julien Masan / /

Position

link extractor / /

Technology

JSON / API / RDF / simulation / main technologies / /

SocialTag