Manage Data Programmatically

Your Datasets, as Code.

Define, version, and manage your AI training data programmatically. Stop wrestling with files and start delivering high-quality datasets through a simple, powerful API.

Join waitlist

datasets.do

import { Dataset } from 'datasets.do';

// Define and register a new dataset schema
const imageCaptions = await Dataset.create({
  name: 'image-caption-pairs-v2',
  description: '1M image-caption pairs for model training.',
  schema: {
    imageUrl: { type: 'string', required: true },
    caption: { type: 'string', required: true },
    source: { type: 'string' }
  }
});

// Add new records via the API
await imageCaptions.addRecords([
  { imageUrl: 'https://cdn.do/img-1.jpg', caption: 'A photo of a cat on a couch.' },
  { imageUrl: 'https://cdn.do/img-2.jpg', caption: 'A photo of a boat on the water.' }
]);

Deliver economically valuable work

Frequently Asked Questions

Do Work. With AI.