This set of datasets provides markup of learning resources (following the LRMI vocabulary) extracted from the Common Crawl of three consecutive years (2013-2015) including 1.7-2.2 billion Web documents. The extracted LRMI markup (quads) consists of up to 70 million statements per year (2015). Further details can be found here:

Field Value
Author LRMI team
Maintainer LRMI team
Last Updated March 16, 2017, 11:37
Created March 16, 2017, 11:14