No results found for query ""
    Search by

    LLM

    Scalable PDF Document Processing with DataChain and Unstructured.io
    Extract and parse text from documents and create vector embeddings in a scalable and distributed way (and less than 70 lines of code).
    • Tibor Mach
    • Sep 30, 20247 min read
    Enforcing JSON Outputs in Commercial LLMs
    The results of our tests on the structured outputs of Google Gemini Pro, Anthropic Claude, and OpenAI GPT. DataChain used for evaluation.
    • Daniel Kharitonov
    • Sep 06, 202410 min read
    Announcing DataChain
    Introducing DataChain - a new open-source tool to curate and process unstructured data using local ML models, and LLM calls.
    • Dmitry Petrov
    • Jul 23, 20244 min read
    Leveraging LLMs in Chatbots: The DVC Approach
    Read how DVC can optimize the development process for chatbots built on Large Language Models.
    • Ryan Turner
    • Sep 25, 20236 min read
    Fine-Tuning Large Language Models with a Production-Grade Pipeline
    This post describes a production ML pipeline for fine-tuning large language models using DVC, SkyPilot, HuggingFace Transformers, and quantization techniques.
    • Alex Kim
    • Sep 08, 202310 min read