No results found for query ""
    Search by

    Product updates, news, tutorials, integrations, and deep dives.

    Tutorial

    Scalable PDF Document Processing with DataChain and Unstructured.io
    Extract and parse text from documents and create vector embeddings in a scalable and distributed way (and less than 70 lines of code).
    • Tibor Mach
    • Sep 30, 20247 min read
    You Do the Math: Fine Tuning Multimodal Models (CLIP) to Match Cartoon Images to Joke Captions
    Learn how to fine tune multimodal models like CLIP to match images to text captions.
    • Dave Berenbaum
    • Sep 12, 20249 min read
    Tutorial: Scalable and Distributed ML Workflows with DVC and Ray on AWS (Part 2)
    Need to setup DVC to work with Ray Cluster on AWS? This tutorial has you covered!
    • Mikhail Rozhkov
    • Mar 13, 202416 min read
    Tutorial: Scalable and Distributed ML Workflows with DVC and Ray (Part 1)
    This tutorial introduces you to integrating DVC (Data Version Control) with Ray, turning them into your go-to toolkit for creating automated, scalable, and distributed ML pipelines.
    • Mikhail Rozhkov
    • Mar 12, 202415 min read
    Running DVC on a SLURM cluster
    Learn how Exscientia uses DVC experiments on a cloud-deployed SLURM cluster to scale their ML experimentation.
    • Dom Miketa
    • Mar 11, 20248 min read
    Leveraging LLMs in Chatbots: The DVC Approach
    Read how DVC can optimize the development process for chatbots built on Large Language Models.
    • Ryan Turner
    • Sep 25, 20236 min read
    Fine-Tuning Large Language Models with a Production-Grade Pipeline
    This post describes a production ML pipeline for fine-tuning large language models using DVC, SkyPilot, HuggingFace Transformers, and quantization techniques.
    • Alex Kim
    • Sep 08, 202310 min read
    Automate model deployment to Amazon SageMaker with the DVC Model Registry
    DVC provides a Git-based mechanism to automate model deployment from an intuitive web UI.
    • Tapa Dipti Sitaula
    • Aug 30, 20236 min read
    Managing OpenFOAM Physical Simulations with DVC, CML, and Studio (Part 2)
    In this second part, we discuss how to utilize cloud computing resources and visualize simulation data with CML and Iterative Studio.
    • Mikhail Rozhkov
    • May 10, 20236 min read