--- QLog_DS ---

http://www.ccc.ipt.pt/~ricardo/datasets/QLog_DS.html

http://www.ccc.ipt.pt/~ricardo/datasets/QLog_DS.zip (for downloading data)


DATASET REFERENCE

This dataset may be used for any research purposes upon referring the following reference:

Campos, R., Jorge, A. and Dias, G. (2011). Using Web Snippets and Query-logs to Measure Implicit Temporal Intents in Queries. In Proceedings of the Query Representation and Understanding Workshop (QRU 2011) associated to 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR2011) Beijing, China, 28 July, pp 13 - 16.

 

SUMMARY

The QLog_DS is a dataset designed for evaluating the the correctness of dates retrieved by the Google and Yahoo auto-completion search engines feature.

It consists of 42 text queries selected from the 27 categories of Google Insights for Search 2010 and 2011 Webpage trends, after removing duplicates, atemporal queries and queries with multiple meanings.

Each query was issued on Google and Yahoo search engines, using the Google and Yahoo Query Log API .

In order to detect the largest number of dates, each query was run in three different ways. As an example consider the query true grit

Query: true grit

The ground truth was obtained by conducting two relevance human judgments, one for the Google Query Logs and another one for the Yahoo Query Logs.

Google Query Logs consists of 283 (q,d) pairs and Yahoo Query Logs consists of 298 (q,d) pairs, where q is the query and d the date.

Each (q, d) pair was assigned a relevance label on a 2-level scale:

The final list of Google judgments consists of 98 (q,d) pairs labeled with score 0, and 185 (q,d) with score 1.

The final list of Yahoo judgments consists of 105 (q,d) pairs labeled with score 0, and 193 (q,d) with score 1.

 

The QLog_DS dataset is an Excel file consisting of two spreadsheets described below:

 

OTHER REFERENCES

More details on this dataset can be found in the following paper:

Campos, R., Jorge, A. and Dias, G. (2011). Using Web Snippets and Query-logs to Measure Implicit Temporal Intents in Queries. In Proceedings of the Query Representation and Understanding Workshop (QRU 2011) associated to 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR2011) Beijing, China, 28 July, pp 13 - 16.

 

DOWNLOAD

http://www.ccc.ipt.pt/~ricardo/datasets/QLog_DS.zip

 

MORE INFO

If you have any further questions, please contact Ricardo Campos (ricardo.campos@ipt.pt).