No results found for query ""
    Search by

    Iterative Blog

    Find here DataChain and DVC news, findings, interesting reads, community takeaways, deep dive into machine learning workflows from data versioning and processing to model productionization.

    You Do the Math: Fine Tuning Multimodal Models (CLIP) to Match Cartoon Images to Joke Captions
    Learn how to fine tune multimodal models like CLIP to match images to text captions.
    • Dave Berenbaum
    • Sep 12, 20248 min read
    Enforcing JSON Outputs in Commercial LLMs
    The results of our tests on the structured outputs of Google Gemini Pro, Anthropic Claude, and OpenAI GPT. DataChain used for evaluation.
    • Daniel Kharitonov
    • Sep 06, 202410 min read
    Dataset Factory - A Toolchain for Generative Computer Vision Datasets
    Learn about our latest approach to mastering your Unstructured Data and metadata.
    • Jeny De Figueiredo
    • Mar 25, 20241 min read
    Tutorial: Scalable and Distributed ML Workflows with DVC and Ray on AWS (Part 2)
    Need to setup DVC to work with Ray Cluster on AWS? This tutorial has you covered!
    • Mikhail Rozhkov
    • Mar 13, 202416 min read
    Tutorial: Scalable and Distributed ML Workflows with DVC and Ray (Part 1)
    This tutorial introduces you to integrating DVC (Data Version Control) with Ray, turning them into your go-to toolkit for creating automated, scalable, and distributed ML pipelines.
    • Mikhail Rozhkov
    • Mar 12, 202415 min read
    Running DVC on a SLURM cluster
    Learn how Exscientia uses DVC experiments on a cloud-deployed SLURM cluster to scale their ML experimentation.
    • Dom Miketa
    • Mar 11, 20248 min read