by
Iterative
Why DataChain?
Blog
Docs
Company
Learn
Pricing
Sign up
Sign in
Open main menu
All
Products
Company
No results found for query ""
Search by
LLM
Scalable PDF Document Processing with DataChain and Unstructured.io
Extract and parse text from documents and create vector embeddings in a scalable and distributed way (and less than 70 lines of code).
Tibor Mach
Sep 30, 2024 • 7 min read
Enforcing JSON Outputs in Commercial LLMs
The results of our tests on the structured outputs of Google Gemini Pro, Anthropic Claude, and OpenAI GPT. DataChain used for evaluation.
Daniel Kharitonov
Sep 06, 2024 • 10 min read
Announcing DataChain
Introducing DataChain - a new open-source tool to curate and process unstructured data using local ML models, and LLM calls.
Dmitry Petrov
Jul 23, 2024 • 4 min read
Leveraging LLMs in Chatbots: The DVC Approach
Read how DVC can optimize the development process for chatbots built on Large Language Models.
Ryan Turner
Sep 25, 2023 • 6 min read
Fine-Tuning Large Language Models with a Production-Grade Pipeline
This post describes a production ML pipeline for fine-tuning large language models using DVC, SkyPilot, HuggingFace Transformers, and quantization techniques.
Alex Kim
Sep 08, 2023 • 10 min read
Ready to get started?
Start for free
Contact us
Book a demo or explore use cases
Explore our
open source tools