arrow Home arrow Web Links
Search
PREMIA
Home
News
Events and Activities
Downloads
Web Links
Link to PREMIA!
Call for Papers
Job Openings Job Openings
Forum Forum
Membership
Member's Area
Registration with ROS
Society Constitution
Online Registration Guide
IAPR Newsletter
PREMIA Newsletter
Contact The Committee Board
Advertisement

Datasets and Corpora

  Web Link Hits
     http://www.ics.uci.edu/~mlearn/MLSummary.html
Machine Learning Repository from UCI.
286
     http://www.kdnuggets.com/datasets/index.html
Another hub listing datasets from many sites.
216
     http://www.cs.cmu.edu/~cil/v-images.html
A comprehensive list of image test sets from the (now defunct) Calibrated Image Lab at CMU.
189
     http://www.ldc.upenn.edu/
The Linguistic Data Consortium supports language-related education, research and technology development by creating and sharing linguistic resources: data, tools and standards. It licenses and supports corpora development in the USA.
168
     http://www.elda.org/sommaire.php
Evaluations and Language resources Distribution Agengy (ELDA), the European agency that distributes and licenses corpora for research and commercial use.
228
     http://www.kdnuggets.com/datasets/competitions.html
Datasets from Machine Learning competitions, including KDD cups.
187
     http://www-cvr.ai.uiuc.edu/ponce_grp/data/
Other Computer Vision Research from the Prof. Ponce research group at UIUC.
201
     http://barissumengen.com/edgeflow.html
This package contains about 13000 segmentations run on test and training images from Berkeley Segmentation Data Set (BSDS).
188
     http://www.vision.caltech.edu/html-files/archive.html
Object images from Caltech's computational vision group.
185
     http://www.mlnet.org/cgi-bin/mlnetois.pl/?File=datasets.html
Machine Learning datasets from MLnet. Add your own datasets here. Some spam links.
255
     http://www.cs.utoronto.ca/~delve/data/datasets.html
Collections of data for developing, evaluating, and comparing learning methods.
360
     http://www.cs.cmu.edu/~webkb/
The Web->KB dataset used in webpage categorization
199
     http://www.nd.edu/~oss/Data/data.html
SourceForge.net Research Data, includes historic and status statistics on approximately 100,000 projects and over 1 million registered users' activities at the project management web site.
209
     http://vision.ece.ucsb.edu/datasets/
Aerial vision datasets from UC Santa Barbera.
212
     http://www.statsci.org/datasets.html
Statistical Science datsets, both fully processed, ready-to-use as well as raw statistical source data.
223
     http://www.comp.nus.edu.sg/~rpnlpir/
The listing of natural language corpora at the National University of Singapore
274