|
http://www.ics.uci.edu/~mlearn/MLSummary.html
Machine Learning Repository from UCI. |
286 |
|
http://www.kdnuggets.com/datasets/index.html
Another hub listing datasets from many sites. |
216 |
|
http://www.cs.cmu.edu/~cil/v-images.html
A comprehensive list of image test sets from the (now defunct) Calibrated Image Lab at CMU. |
189 |
|
http://www.ldc.upenn.edu/
The Linguistic Data Consortium supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards. It licenses and supports corpora development in the USA. |
168 |
|
http://www.elda.org/sommaire.php
Evaluations and Language resources Distribution Agengy (ELDA), the European agency that distributes and licenses corpora for research and commercial use. |
228 |
|
http://www.kdnuggets.com/datasets/competitions.html
Datasets from Machine Learning competitions, including KDD cups. |
187 |
|
http://www-cvr.ai.uiuc.edu/ponce_grp/data/
Other Computer Vision Research from the Prof. Ponce research group at UIUC. |
201 |
|
http://barissumengen.com/edgeflow.html
This package contains about 13000 segmentations run on test and training images from Berkeley Segmentation Data Set (BSDS). |
188 |
|
http://www.vision.caltech.edu/html-files/archive.html
Object images from Caltech's computational vision group. |
185 |
|
http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html
Machine Learning datasets from MLnet. Add your own datasets here. Some spam links. |
255 |
|
http://www.cs.utoronto.ca/~delve/data/datasets.html
Collections of data for developing, evaluating, and comparing learning methods. |
360 |
|
http://www.cs.cmu.edu/~webkb/
The Web->KB dataset used in webpage categorization |
199 |
|
http://www.nd.edu/~oss/Data/data.html
SourceForge.net Research Data, includes historic and status statistics on approximately 100,000 projects and over 1 million registered users' activities at the project management web site. |
209 |
|
http://vision.ece.ucsb.edu/datasets/
Aerial vision datasets from UC Santa Barbera. |
212 |
|
http://www.statsci.org/datasets.html
Statistical Science datsets, both fully processed, ready-to-use as well as raw statistical source data. |
223 |
|
http://www.comp.nus.edu.sg/~rpnlpir/
The listing of natural language corpora at the National University of Singapore |
274 |