SDK Changelog - Labellerr Documentation

Latest Version: v2.0.0 - Major refactoring with improved architecture, enhanced payload structure, and new features. See details below.

Version 2.0.0 (November 2025)

Breaking Changes - This release includes significant API changes. Please review the migration notes carefully.

Architecture & Infrastructure

Modular Architecture

Complete restructuring from monolithic 2000+ line files to organized, maintainable modules:New Structure:

labellerr/core/annotation_templates/ - Template management
labellerr/core/autolabel/ - Model training features
labellerr/core/connectors/ - S3 and GCS connections
labellerr/core/datasets/ - Dataset CRUD operations
labellerr/core/exports/ - Export handling with polling
labellerr/core/projects/ - Project management
labellerr/core/schemas/ - Pydantic validation models
labellerr/core/users/ - User management

Benefits:

Separation of concerns with single responsibility per module
Factory pattern for automatic type-specific object instantiation
Easier maintenance and extensibility
Better code discoverability

Enhanced Payload Structure with Type Safety

Major Improvement: All function parameters now use Pydantic models for validation instead of plain dictionaries.Old Approach:

# Plain dictionary - no validation
payload = {"project_name": "My Project", "data_type": "image", ...}

New Approach:

from labellerr.core.schemas import CreateProjectParams, RotationConfig

# Type-safe with automatic validation
params = CreateProjectParams(
    project_name="My Project",
    data_type="image",
    rotations=RotationConfig(
        annotation_rotation_count=1,
        review_rotation_count=1,
        client_review_rotation_count=1
    )
)

Benefits:

Automatic validation before API calls
IDE autocomplete and type checking
Clear, self-documenting schema fields
Early error detection
Organized schemas by domain (autolabel, connectors, datasets, exports, files, projects, users)

Improved Project Creation Pipeline

New Workflow: Project creation now follows an explicit, modular pipeline for better control and reusability.Old Approach:

# Everything in one call - datasets created inline
project = create_project(
    client=client,
    payload={
        "project_name": "My Project",
        "folder_to_upload": "./images",
        "annotation_template_id": "template_123",
        ...
    }
)

New Approach:

# Step 1: Create dataset
dataset = create_dataset_from_local(
    client=client,
    dataset_config=DatasetConfig(
        dataset_name="My Dataset",
        data_type="image"
    ),
    folder_to_upload="./images"
)
dataset.status()  # Wait for completion

# Step 2: Get annotation template
template = LabellerrAnnotationTemplate(client, "template_123")

# Step 3: Create project with explicit dependencies
project = create_project(
    client=client,
    params=CreateProjectParams(...),
    datasets=[dataset],
    annotation_template=template
)

Why This Change?

Explicit Dependencies: Datasets and templates validated before project creation
Better Error Handling: Issues caught earlier in the pipeline
Reusability: Same dataset can be attached to multiple projects
Clearer Flow: Each step is explicit, testable, and maintainable
Type Safety: Pydantic validation at each step

New Features

Cloud Export Support (AWS S3)

Export annotations directly to cloud storage with automatic status polling:

from labellerr.core.schemas import CreateExportParams

export = project.create_export(
    export_config=CreateExportParams(
        export_name="Production Export",
        export_format="json",
        connection_id="s3_conn_123",
        export_destination="s3",
        export_folder_path="bucket/exports/"
    )
)
# Automatically polls until completion
final_status = export.status()

Features:

Direct export to AWS S3
Automatic polling mechanism
Multiple export formats (JSON, CSV, COCO JSON)
Status filtering support

Autolabel Model Training

Train custom models on your annotated data with configurable hyperparameters:

from labellerr.core.autolabel import LabellerrAutoLabel
from labellerr.core.schemas import TrainingRequest, Hyperparameters

autolabel = LabellerrAutoLabel(client=client)
job = autolabel.train(
    training_request=TrainingRequest(
        model_id="yolov11",
        hyperparameters=Hyperparameters(
            epochs=100,
            batch_size=16,
            learning_rate=0.001
        ),
        job_name="Custom Training"
    )
)

Features:

Train YOLOv11 and other models
Configurable hyperparameters
Job status tracking
List all training jobs

Enhanced Dataset Operations

Split Dataset Creation:

create_dataset_from_local() - For local file/folder uploads
create_dataset_from_connection() - For cloud storage connections

Auto-Pagination:

# Iterate through all datasets automatically
for dataset in list_datasets(client, datatype="image", scope="client", page_size=-1):
    process(dataset)

Additional Features:

New delete_dataset() function
Improved sync operations for AWS S3 and GCS
Better status tracking with polling
Type-specific dataset classes (Image, Video, Audio, Document)

Annotation Template Management

Dedicated module for structured annotation template creation:

from labellerr.core.annotation_templates import create_annotation_guideline
from labellerr.core.schemas import CreateTemplateParams, AnnotationQuestion

template = create_annotation_guideline(
    client=client,
    params=CreateTemplateParams(
        data_type="image",
        template_name="Object Detection Template",
        questions=[
            AnnotationQuestion(
                option_type="BoundingBox",
                question_text="Identify objects",
                options=[...]
            )
        ]
    )
)

Features:

Type-safe question definitions
Reusable templates across projects
Multiple question types (BoundingBox, polygon, radio, etc.)
Automatic validation

Enhanced Connection Management

Dedicated Connection Classes:

LabellerrS3Connection - AWS S3 operations
LabellerrGCSConnection - Google Cloud Storage operations

Features:

from labellerr.core.connectors import LabellerrS3Connection
from labellerr.core.schemas import AWSConnectionParams

# Test connection before creating
test_result = LabellerrS3Connection.test_connection(client, params)

# Create connection with type safety
connection = LabellerrS3Connection.create_connection(
    client=client,
    params=AWSConnectionParams(
        aws_access_key="...",
        aws_secrets_key="...",
        s3_path="s3://bucket/path/",
        connection_type="export",
        name="Production S3"
    )
)

Connection testing before creation
Support for import and export types
Better error handling
Type-safe parameters

Breaking Changes

Project Creation (HIGH IMPACT)

Old approach:

project = create_project(
    client=client,
    payload={
        "project_name": "My Project",
        "folder_to_upload": "./images",
        "annotation_template_id": "template_123",
        ...
    }
)

New approach:

# Step 1: Create dataset
dataset = create_dataset_from_local(
    client=client,
    dataset_config=DatasetConfig(
        dataset_name="My Dataset",
        data_type="image"
    ),
    folder_to_upload="./images"
)
dataset.status()  # Wait for completion

# Step 2: Get annotation template
template = LabellerrAnnotationTemplate(client, "template_123")

# Step 3: Create project
project = create_project(
    client=client,
    params=CreateProjectParams(...),
    datasets=[dataset],
    annotation_template=template
)

Why? Explicit dependencies, better error handling, and dataset reusability.

Dataset Creation (MEDIUM IMPACT)

Old:

dataset = create_dataset(
    client=client,
    dataset_config={...},
    folder_to_upload="./images"
)

New:

from labellerr.core.datasets import create_dataset_from_local
from labellerr.core.schemas import DatasetConfig

dataset = create_dataset_from_local(
    client=client,
    dataset_config=DatasetConfig(
        dataset_name="My Dataset",
        data_type="image"
    ),
    folder_to_upload="./images"
)

Import Changes (LOW IMPACT)

Update imports from root to labellerr.core.*:

# Core imports remain backward compatible
from labellerr import LabellerrClient

# New modular imports
from labellerr.core.datasets import create_dataset_from_local, LabellerrDataset
from labellerr.core.projects import create_project, LabellerrProject
from labellerr.core.schemas import DatasetConfig, CreateProjectParams
from labellerr.core.connectors import LabellerrS3Connection

Additional Improvements

Connection Pooling: Improved HTTP connection management for better performance
Error Handling: Enhanced error messages with automatic retry logic
Polling Mechanism: Smart status tracking for long-running operations (exports, datasets)
Testing Infrastructure: Comprehensive unit and integration test coverage
Documentation: Improved code examples and API documentation
Type-Specific Classes: Specialized methods for Image, Video, Audio, and Document types
Performance: Optimized request handling and data transfer

Migration Notes

Update SDK: pip install --upgrade labellerr
Update imports: Change to new labellerr.core.* modules
Refactor project creation: Follow new 3-step process (dataset → template → project)
Use Pydantic schemas: Replace dict payloads with schema models
Test thoroughly: Validate all SDK operations in dev/staging before production

Need help migrating? See our detailed migration guide or contact [email protected]

GitHub Release: v2.0.0

Version 1.0.1 (September 2025)

Improvements

New project creation flow with updated payload structure
Better validation for annotation_guide and annotation_template_id
"created_by" email now included automatically during project creation
Modularized SDK structure for easier maintenance
Automated release pipeline added via release_manager
Enhanced export status handling (polling + download URL fetching)

Bug Fixes

Fixed check_export_status inconsistencies
Improved error handling for invalid project creation payloads
Corrected edge-case failures in export polling
Minor fixes in annotation guide validation logic

Branch Strategy

main → production-ready releases
develop → pre-release testing versions

GitHub Release: https://github.com/Labellerr/SDKPython/releases/tag/v1.0.1

Version 1.0.0 (December 2024)

Initial Release

First public release of the Labellerr Python SDK.

Core Features

Client Initialization: Authenticate using LabellerrClient
Project Management: Create and configure annotation projects
Dataset Operations: Manage datasets across image, video, audio, text, and documents
Pre-annotations: Upload annotations (sync & async)
Export Functionality: Export annotations in json, coco_json, csv, png
Retrieval APIs: Fetch all projects and datasets
Error Handling: LabellerrError for managed exception handling

Core Modules

LabellerrClient — main API client
Project operations (initiate_create_project)
Pre-annotation upload methods
Local export utilities
Dataset retrieval helpers

GitHub Release: https://github.com/Labellerr/SDKPython/releases/tag/v1

Support & Resources

Documentation

Comprehensive SDK guides and examples

GitHub

View source code and report issues

Support

Get help from our team

Cookbooks

End-to-end examples and tutorials

Semantic Versioning: We follow SemVer. Breaking changes increment the major version (e.g., 1.x → 2.0).

SDK Documentation

​Version 2.0.0 (November 2025)

​Architecture & Infrastructure

​New Features

​Breaking Changes

​Additional Improvements

​Migration Notes

​Version 1.0.1 (September 2025)

​Improvements

​Bug Fixes

​Branch Strategy

​Version 1.0.0 (December 2024)

​Initial Release

​Core Features

​Core Modules

​Support & Resources

Documentation

GitHub

Support

Cookbooks

Version 2.0.0 (November 2025)

Architecture & Infrastructure

New Features

Breaking Changes

Additional Improvements

Migration Notes

Version 1.0.1 (September 2025)

Improvements

Bug Fixes

Branch Strategy

Version 1.0.0 (December 2024)

Initial Release

Core Features

Core Modules

Support & Resources