TREC Resources
From ATIRE
Document counts and Topics used
Author's note: topics above 450 may not be accurately mapped. Please email if you have corrections.
| Conference | Collection | Document Count | Topics |
|---|---|---|---|
| TREC 1 | TREC 1 | 741,856 | 51-100 |
| TREC 2 | TREC 1 | 741,856 | 101-150 |
| TREC 3 | TREC 1 | 741,856 | 151-200 |
| TREC 4 | TREC 4 | 567,529 | 201-250 |
| TREC 5 | TREC 5 | 524,929 | 251-300 |
| TREC 6 | TREC 6 | 556,077 | 301-350 |
| TREC 7 | TREC 7 | 528,155 | 351-400 |
| TREC 8 | TREC 7 | 528,155 | 401-450 |
| WT2G | 247,491 | ||
| WT10G | 1,692,096 | 451-500 | |
| WT100G | 20,616,457 | ||
| DOTGOV | 1,247,753 | 551-600 | |
| DOTGOV2 | 25,205,179 | 701-800 |
Resource links
Queries and Topics are hosted externally.
For queries, navigate to http://trec.nist.gov/data/qrels_eng/ and download the file listed.
For topics, navigate to http://trec.nist.gov/data/topics_eng/index.html and download the file listed.
For indexes, save the listed file. They are approximately 225 Mb each.
| Conference | Queries | Topics | Index |
|---|---|---|---|
| TREC 1 | qrels.51-100.disk1.disk2.parts1-5.tar.gz | TREC-1 ad hoc & TREC-2 routing topics | Media:TREC_1_index.aspt |
| TREC 2 | qrels.101-150.disk1.disk2.parts1-5.tar.gz | TREC-2 ad hoc & TREC-3 routing topics | Media:TREC_1_index.aspt |
| TREC 3 | qrels.151-200.201-250.disks1-3.all.tar.gz | TREC-3 ad hoc topics | Media:TREC_1_index.aspt |
| TREC 4 | qrels.201-250.disk2.disk3.parts1-5.tar.gz | TREC-4 ad hoc topics | Media:TREC_4_index.aspt |
| TREC 5 | qrels.251-300.parts1-5.tar.gz | TREC-5 ad hoc topics | Media:TREC_5_index.aspt |
| TREC 6 | qrels.trec6.adhoc.parts1-5.tar.gz | TREC-6 ad hoc topics | Media:TREC_6_index.aspt |
| TREC 7 | qrels.trec7.adhoc.parts1-5.tar.gz | TREC-7 ad hoc and TREC-8 filtering topics | Media:TREC_7_index.aspt |
| TREC 8 | qrels.trec8.adhoc.parts1-5.tar.gz | TREC-8 ad hoc and small web topics | Media:TREC_7_index.aspt |
| WT2G | |||
| WT10G | http://trec.nist.gov/data/t9.web.html http://trec.nist.gov/data/t10.web.html | ||
| WT100G | |||
| DOTGOV | |||
| DOTGOV2 |
Note that TREC 1 collection was used for TRECs 1-3, and TREC 7 collection was used for TRECs 7-8.
Usage
For a more thorough introduction, see How to use indexes.
If you have index.aspt in the working directory:
~$ ./atire -QN:t -q topics.51-100 -a qrels.51-100 -s-p -l0
Otherwise, you can specify its location with -findex.

Copyright © 2011 ATIRE.ORG. ALL Rights Reserved