Blogs about: Datasets

Featured Blog

Maintaining the distinction between metadata and content files : datasets2 comments

neilgodfrey wrote 3 weeks ago: Case study: an archival body responsible for cultural preservation Case study: An archival body has … more →

Tags: Metadata, CDWA, Cultural Heritage, dataset, EAD, ISAD, ISAD(G), Mods

Can I see some I.D., please?

Rich Parma wrote 1 month ago: After dealing with a couple of datasets in a row with missing or imperfect identifiers, I thought I … more →

Tags: Data Management, identifiers

Music Datasets for MIR

lgustavomartins wrote 2 months ago: Publicly available music datasets are like gold mines for anyone working in MIR. And such datasets b … more →

Tags: Mir, Music, Palco Principal, magnatagatune, Mirex, Creative Commons, ccmixter

Introduction to Datasets

ravivarmathumati wrote 3 months ago: Datasets store data in a disconnected cache. The structure of a dataset is similar to that of a re … more →

Tags: Introduction to Datasets

Save and load Dataset Objects from XML

Nick Masao wrote 4 months ago: I love the .NET framework for the simplicity it has brought to application development in so many wa … more →

Tags: Programming, readXml, writeXml

Amazon Web Services hosts DBpedia, Freebase data sets

kellyjoseph wrote 4 months ago: The Infochimps.org community played part in pushing DBpedia and Freebase data sets  to Amazon Web Se … more →

Tags: AWS, infochimps.org, machetEC2, dbpedia, freebase, Linked Data

Publicly available datasets from Amazon

Jeff wrote 4 months ago: Well dang. Amazon just made transportation databases, a Freebase database dump, the English version … more →

Tags: musings, Research, Amazon

New text resources available

aufrank wrote 5 months ago: Two new resources have recently become available that may be of interest to the NLP and Psycholingus … more →

Tags: corpus-based research, HLP lab, Statistics & Methodology, corpora, Research

The Asdrubal Cabrera Hall of Fame

mrflip wrote 5 months ago: Prompted by my friend’s skepticism that the ballplayer Milton Bradley is really so named, I … more →

Tags: something, asdrubal, Baseball, cabrera, coco crisp, first name, Hall Of Fame, Milton Bradley, mysql

Massive Scrape of Twitter's Friend Graph25 comments

mrflip wrote 6 months ago: UPDATE: We’ve taken the data down for the moment, at Twitter’s request. STAY CALM. They … more →

Tags: Main, big, bigdata, Data, dataset, Download, followers, Free, Friend

The Petabyte Age

Michael Chen wrote 7 months ago: Wired Magazine’s feature on the future of data. “Sensors everywhere. Infinite storage. … more →

Tags: 2008-2009, Research Resources, ubiquitous computing, wired, sensory architecture

Data-on-a-stick. Deepfried for extra goodness.

pos.thum.us wrote 8 months ago: For the 3TU Datacentrum we are busy setting up an infrastructure to store and preserve scientific da … more →

Tags: TUDelft, Work, 3TU, Data, RDF, xml

Many Eyes

BrendanB wrote 10 months ago: First, let me apologize for the lack of consistancy of postings.  A few months ago, Dan and I commit … more →

Tags: General, visualization

Troubleshooting the "Failed to enable constraints..." DataSet Error

sapientcoder wrote 11 months ago: At one time or another, many of us who have worked with DataSets in .NET have received this forebodi … more →

Tags: .NET, dataset

What Transformations Do Traditional Datasets Need To Be Put Through Before Migrating On-The-Cloud?1 comment

Peter Benza wrote 11 months ago: [Be First To Answer This Question.] … more →

Tags: Architecture, Data migration, Performance, cloudsets

What advantages do cloudsets have over traditional datasets stored in a corporate data warehouse? 1 comment

Peter Benza wrote 11 months ago: [Be first to leave a comment for this question.] … more →

Tags: Business Culture, Access, functionality, Data Governance, Control Management, Performance, cloud-based solutions

How should data residing in a cloud be classified?1 comment

Peter Benza wrote 11 months ago: [Be first to leave a comment for this question.] … more →

Tags: Compliance, Data Governance, cloudsets

Infinite Monkeywrench hosted on GitHub

dhruvbansal wrote 1 year ago: Rejoice, you open-source orangutans, for the powerful, the weighty, the Infinite Monkeywrench is now … more →

Tags: infochimps.org, IMW, Git

Database Spotlight: Roper Center for Public Opinion

akelley wrote 1 year ago: In case you haven’t noticed the Database Spotlight on the GMU Libraries’ page, it’ … more →

Tags: Reference, Library, Polls, Surveys, Public Opinion


Have your say. Start a blog.

See our free features →

Related Tags
All →

Follow this tag via RSS

Find other items tagged with “datasets”:
Technorati Del.icio.us IceRocket