SNORKEL: THE SYSTEM FOR

Programmatically Building and Managing Training Data

Pattern 3

Build

Build Training Sets Programmatically

Labeling and managing training datasets by hand is one of the biggest bottlenecks in machine learning. In Snorkel, write heuristic functions to do this programmatically instead!

Get Started!

Model

Model Weak Supervision

Programmatic or weak supervision sources can be noisy and correlated. Snorkel uses novel, theoretically-grounded unsupervised modeling techniques to automatically clean and integrate them.

Read More!

Train

Train Modern ML Models

Snorkel outputs clean, confidence-weighted training datasets that easily plug into any modern machine learning framework.

Tutorials

start in minutes

Quickstart

# For pip users
pip install snorkel

# For conda users
conda install snorkel -c conda-forge

GitHub Get Started

USERS & SPONSORS

Logos
SEE ALL BLOGS