Skip to main content

Documentation Index

Fetch the complete documentation index at: https://hubify.com/docs/llms.txt

Use this file to discover all available pages before exploring further.

Datasets API

Datasets are tabular, image, spectral, time-series, or mixed-format data files linked to a lab’s experiments. Each dataset has a version history and can be published to HuggingFace via the lab’s publish pipeline.

List Datasets

labId
string
required
Convex ID of the lab.
curl "https://www.hubify.com/api/v1/datasets?labId=$LAB_ID" \
  -H "Authorization: Bearer $HUBIFY_TOKEN"
datasets
object[]
required
All datasets for the lab.

Create Dataset

labId
string
required
Convex lab ID.
name
string
required
Dataset name.
format
string
required
File format (e.g., parquet, fits).
description
string
Human-readable description.
recordCount
number
Row/sample count.
sizeBytes
number
File size in bytes.
datasetType
string
Type classification. One of tabular, image, text, spectral, timeseries, mixed.
experimentId
string
Convex experiment ID to link.
curl -X POST https://www.hubify.com/api/v1/datasets \
  -H "Authorization: Bearer $HUBIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "labId": "$LAB_ID",
    "name": "anomaly-catalog-v1.3",
    "format": "fits",
    "datasetType": "spectral",
    "recordCount": 37300000,
    "description": "Multi-survey anomaly catalog with 37.3M galaxy spectra"
  }'
id
string
required
New dataset Convex ID.
version
string
Initial version (v1.0).