WEX

From Freebase

Revision as of 05:05, 8 July 2010 by Viral (Talk | contribs)
Jump to: navigation, search

The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted in tabular form. Freebase WEX is provided as a set of database tables in TSV format for PostgreSQL, along with tables providing mappings between Wikipedia articles and Freebase topics, and corresponding Freebase Types.

Contents

Download

Freebase WEX is provided free of charge for any purpose with regular updates by Metaweb Technologies. It is distributed, like Wikipedia itself, under the terms of version 1.2 of the GNU Free Documentation License or any later version published by the Free Software Foundation. You can download Freebase WEX from here.

Documentation

See WEX/Documentation for complete documentation.

Citing

If you'd like to cite WEX in a publication, you may use:

Metaweb Technologies, Freebase Wikipedia Extraction (WEX), http://download.freebase.com/wex/, <month> <day>, <year>

Or as BibTeX:

 
@misc{metaweb:wex,
  title = "Freebase Wikipedia Extraction (WEX)",
  author = "Metaweb Technologies",
  howpublished = "\url{http://download.freebase.com/wex/}",
  edition = "<month> <day>, <year>",
  year = "<year>"
}

Related Work

  • DBpedia, "a community effort to extract structured information from Wikipedia and to make this information available on the Web", http://dbpedia.org/
  • Hugo Zaragoza, Jordi Atserias, Massimiliano Ciaramita and Giuseppe Attardi, (Yahoo! Research Barcelona), Semantically Annotated Snapshot of the English Wikipedia, http://www.yr-bcn.es/semanticWikipedia, 2007.

See also

Personal tools