No results found for query ""
    Search by

    Heavy Data

    Parquet Is Great for Tables, Terrible for Video - Here's Why
    Parquet is great for tables, terrible for images and video. Here's why shoving heavy data into columnar formats is the wrong approach - and what we should build instead. Hint: it's not about the formats, it's about the metadata.
    • Dmitry Petrov
    • Sep 03, 20255 min read
    From Big Data to Heavy Data: Rethinking the AI Stack
    LLMs can finally interpret unstructured video, audio, and documents — but they can't do it alone. This post introduces the concept of heavy data and explores how modern teams build multimodal pipelines to turn it into AI-ready data.
    • Dmitry Petrov
    • Jun 09, 20253 min read