by
Iterative
Why DataChain?
Blog
Docs
Company
Learn
Pricing
Sign up
Sign in
Open main menu
All
Products
Company
No results found for query ""
Search by
Machine Learning
As GenAI Fever Fades - Time to Prioritize Robust Engineering Over Overblown Promises
Improved Engineering and Data Management will be what carries GenAI into maturity
Dmitry Petrov
Oct 23, 2024 • 3 min read
Scalable PDF Document Processing with DataChain and Unstructured.io
Extract and parse text from documents and create vector embeddings in a scalable and distributed way (and less than 70 lines of code).
Tibor Mach
Sep 30, 2024 • 7 min read
Post-modern AI Data Stack
How and Why Generative AI will change the modern data stack.
Daniel Kharitonov
Sep 24, 2024 • 7 min read
You Do the Math: Fine Tuning Multimodal Models (CLIP) to Match Cartoon Images to Joke Captions
Learn how to fine tune multimodal models like CLIP to match images to text captions.
Dave Berenbaum
Sep 12, 2024 • 9 min read
Enforcing JSON Outputs in Commercial LLMs
The results of our tests on the structured outputs of Google Gemini Pro, Anthropic Claude, and OpenAI GPT. DataChain used for evaluation.
Daniel Kharitonov
Sep 06, 2024 • 10 min read
Announcing DataChain
Introducing DataChain - a new open-source tool to curate and process unstructured data using local ML models, and LLM calls.
Dmitry Petrov
Jul 23, 2024 • 4 min read
Dataset Factory - A Toolchain for Generative Computer Vision Datasets
Learn about our latest approach to mastering your Unstructured Data and metadata.
Jeny De Figueiredo
Mar 25, 2024 • 1 min read
Tutorial: Scalable and Distributed ML Workflows with DVC and Ray on AWS (Part 2)
Need to setup DVC to work with Ray Cluster on AWS? This tutorial has you covered!
Mikhail Rozhkov
Mar 13, 2024 • 16 min read
Tutorial: Scalable and Distributed ML Workflows with DVC and Ray (Part 1)
This tutorial introduces you to integrating DVC (Data Version Control) with Ray, turning them into your go-to toolkit for creating automated, scalable, and distributed ML pipelines.
Mikhail Rozhkov
Mar 12, 2024 • 15 min read
Older posts
Ready to get started?
Start for free
Contact us
Book a demo or explore use cases
Explore our
open source tools