Back to Results
First PageMeta Content
Data / RDFa / Microformat / Embedded RDF / Microdata / Schema.org / HReview / Resource Description Framework / HCard / Semantic Web / World Wide Web / Information


Web Data Commons – Providing Structured Data from 1.5 Billion Web Pages
Add to Reading List

Document Date: 2012-10-24 09:30:23


Open Document

File Size: 132,18 KB

Share Result on Facebook

City

Beijing / Lyon / Campinas / /

Company

Amazon / SignalGroup / Google / Yahoo! / Microsoft / Facebook / /

Country

France / China / /

Currency

EUR / /

/

IndustryTerm

web documents / web corpus / large web corpora / Web using Microformats / fewer web pages / web index / individual web pages / instance products / Web corpora / archived web page / entity-oriented search / web / frequent / web crawls / Web corpera / web resources / Web Corpora Hannes Mühleisen Christian Bizer Web-based Systems Group Freie Universität Berlin Germany Web-based Systems / media files / web authors / analyzed web corpus / e-commerce data / Web Data Commons project / web pages using crawler software / search results / up-todata web corpus / /

Organization

Crawl1 / Common Crawl / /

Person

Hannes Mühleisen Christian Bizer / /

Position

author / RDF extractor / mayor / Any23 extractor / /

ProgrammingLanguage

XPath / HTML / /

Technology

http / Resource Description Framework / DOM / RDF / HTML / /

URL

http /

SocialTag