# Create Dataset Create a new dataset. Training Version: The training_version parameter determines the dataset's compatibility and structure: * Legacy (max, light, 3.2, 2.3): Uses text captions and caption_prefix. Compatible with each other. * FIBO (fibo): Uses JSON structured data (visual_schema). caption_prefix is null. Defaults & constraints: * training_version defaults to max (will change to fibo in future updates). * training_version is immutable after creation. Project Compatibility & Automatic Assignment: When creating a dataset, the system validates compatibility with the parent Project: 1. Automatic Assignment: If the Project's training_version is null, it will automatically inherit the training_version of this new dataset. The Project will then be locked to this version family. 2. Validation: If the Project already has a training_version set: * FIBO Projects: Can only contain fibo datasets. * Legacy Projects: Can contain any legacy dataset (max, light, 3.2, 2.3). * Mixing Forbidden: You cannot create a fibo dataset in a legacy project, or vice versa. Completion Requirements: * Legacy: Minimum 1 image required to mark as completed. * FIBO: Minimum 5 images required to mark as complete Upload types: * Basic upload type: Supports up to 200 images, uploading image files * Advanced upload type: Supports up to 5000 images, uploading a zip file Endpoint: POST /tailored-gen/datasets ## Header parameters: - `api_token` (string, required) ## Request fields (application/json): - `project_id` (integer) Associated project ID (required) - `name` (string) Dataset name (required) - `training_version` (string) Specifies the foundation model version this dataset is intended for. * fibo: Uses JSON structured data (visual_schema). Requires generating a schema before uploading images. * max (default), light, 3.2, 2.3: Legacy versions using text captions and caption_prefix. Enum: "max", "light", "3.2", "2.3", "fibo" - `upload_type` (string) The method used to upload images to the dataset. 'basic' is the default Enum: "basic", "advanced" ## Response 201 fields (application/json): - `id` (integer) Unique identifier for the dataset - `project_id` (integer) Associated project ID - `name` (string) Dataset name - `training_version` (string) The foundation model version this dataset targets (e.g., 'fibo', 'max'). Enum: "max", "light", "3.2", "2.3", "fibo" - `caption_prefix` (string,null) Text prepended to captions. * For training_version = max/light/3.2/2.3: Required string. * For training_version = fibo: Null. - `visual_schema` (string,null) A string containing the JSON structure representing the visual backbone. * For training_version = fibo: Required string (must be generated via /generate_visual_schema). * For training_version = max/light/3.2/2.3: Null. - `status` (string) Status of the dataset Enum: "draft" - `captions_update_status` (string) Status of captions update process Enum: "empty", "in_progress", "completed" - `upload_type` (string) Upload type Enum: "basic", "advanced" - `captions_source` (string) The source of the captions. For 'basic' datasets, this is a null. For 'advanced' datasets, this indicates if captions were generated 'automatic' or provided 'manual'. this value is determined during bulk-upload Enum: "automatic", "manual" - `created_at` (string) Timestamp when the dataset was created - `updated_at` (string) Timestamp when the dataset was last updated ## Response 400 fields (application/json): - `errors` (array) - `errors.msg` (string) A human-readable description of the error. - `errors.field_name` (string) The name of the field that caused the error. - `errors.field_input` (string) The input value of the field that caused the error. ## Response 401 fields ## Response 404 fields ## Response 429 fields ## Response 500 fields