LRMI on the Web

This set of datasets provides markup of learning resources (following the LRMI vocabulary) extracted from the Common Crawl of three consecutive years (2013-2015) including 1.7-2.2 billion Web documents. The extracted LRMI markup (quads) consists of up to 70 million statements per year (2015). Further details can be found here: https://stefandietze.files.wordpress.com/2017/03/lrmi-www2017-cam-ready.pdf.

Data and Resources

Additional Info

Field Value
Author LRMI team
Maintainer LRMI team
Last Updated March 16, 2017, 11:37
Created March 16, 2017, 11:14