WEX

From Freebase

Jump to: navigation, search
The WEX dataset is no longer be generated. Please see the Wikimedia data dumps for up-to-date dumps of Wikipedia.

The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted in tabular form. Freebase WEX is provided as a set of database tables in TSV format for PostgreSQL, along with tables providing mappings between Wikipedia articles and Freebase topics, and corresponding Freebase Types.

Contents

Download

Freebase WEX is distributed, like Wikipedia itself, under the terms of version 1.2 of the GNU Free Documentation License or any later version published by the Free Software Foundation.

Download Freebase WEX

Documentation

See WEX/Documentation for complete documentation.

Citing

If you'd like to cite WEX in a publication, you may use:

Google, Freebase Wikipedia Extraction (WEX), http://download.freebase.com/wex/, <month> <day>, <year>

Or as BibTeX:

 
@misc{freebase:wex,
  title = "Freebase Wikipedia Extraction (WEX)",
  author = "Metaweb Technologies",
  howpublished = "\url{http://download.freebase.com/wex/}",
  edition = "<month> <day>, <year>",
  year = "<year>"
}

Related Work

  • DBpedia, "a community effort to extract structured information from Wikipedia and to make this information available on the Web", http://dbpedia.org/
  • Hugo Zaragoza, Jordi Atserias, Massimiliano Ciaramita and Giuseppe Attardi, (Yahoo! Research Barcelona), Semantically Annotated Snapshot of the English Wikipedia, Yahoo! Barcelona, 2007.

See also

Personal tools