Parquet vs ORC vs Avro vs Delta Lake Photo by Viktor Talashuk on Unsplash The big data world is full of various storage systems, heavily influenced by different file formats. These are key in nearly all data pipelines, allowing for efficient data
Navigating the waves of the web ocean
Parquet vs ORC vs Avro vs Delta Lake Photo by Viktor Talashuk on Unsplash The big data world is full of various storage systems, heavily influenced by different file formats. These are key in nearly all data pipelines, allowing for efficient data
The Secret to Running LLMs on Consumer Hardware! Continue reading on Towards Data Science »
How to use a large language model to convert questions about a dataset into code that runs on-the-fly to deliver the answers, all web… Continue reading on Towards Data Science »
Let’s reverse-engineer “R1” and its Large Action Model Continue reading on Towards Data Science »
What tech layoffs taught me in 2023 Continue reading on Towards Data Science »
Transforming Backpropagation’s Complex Math into Manageable and Easy-to-Learn Bites Continue reading on Towards Data Science »
Taking a break from the generative AI hype around LLMs and foundation models, let’s explore how synthetic data created by more traditional generative AI models are set for mainstream adoption. Image generated by Arne Rustad using DALLE-3. Data is as valuable
From GeoJSON to UTM, these tools help map the world! Continue reading on Towards Data Science »
A Bonus Article for “The Book of Why” Series Continue reading on Towards Data Science »
When ReLU’s extrapolation capabilities are not enough Continue reading on Towards Data Science »
Recent Comments