The ClueWeb12 Dataset:
Online Services
The Lemur Project provides several online services to simplify use of the ClueWeb12 dataset.
-
Batch Query Service - Full Dataset: Use the Indri search engine to search the ClueWeb12 dataset
-
Interactive Search - Full Dataset: Use the Indri search engine to interactively search the ClueWeb12 dataset
-
Interactive Search - B13 Subset: Use the Indri search engine to interactively search the ClueWeb12 B13 dataset
-
Page Rendering: Render selected ClueWeb12 web pages (text + images).
-
Attribute Lookup Service: Fast lookup of ClueWeb12 document attributes.
Some of these services require a user name and password. If your organization has a license to use the ClueWeb12 dataset, you can obtain a username and password by contacting Jamie Callan.
Our computational resources are limited, so we require programs to abide by our usage policy.