Unfolding the universe of possibilities..

Navigating the waves of the web ocean

Comparing Performance of Big Data File Formats: A Practical Guide

Parquet vs ORC vs Avro vs Delta Lake Photo by Viktor Talashuk on Unsplash The big data world is full of various storage systems, heavily influenced by different file formats. These are key in nearly all data pipelines, allowing for efficient data

The Most Simple Way to Set Up ChatGPT Locally

The Secret to Running LLMs on Consumer Hardware! Continue reading on Towards Data Science »

Exploring data analysis via natural language — approach 1

How to use a large language model to convert questions about a dataset into code that runs on-the-fly to deliver the answers, all web… Continue reading on Towards Data Science »

Rabbit’s New AI Device Uses Apps On Your Behalf to “Do Anything” — But How Exactly Does It Work?

Let’s reverse-engineer “R1” and its Large Action Model Continue reading on Towards Data Science »

How to Make Yourself More Layoff-Proof as a Data Scientist

What tech layoffs taught me in 2023 Continue reading on Towards Data Science »

Courage to Learn ML: Explain Backpropagation from Mathematical Theory to Coding Practice

Transforming Backpropagation’s Complex Math into Manageable and Easy-to-Learn Bites Continue reading on Towards Data Science »

Adaption of Generative Methods for Anonymization will Revolutionize Data Sharing and Privacy

Taking a break from the generative AI hype around LLMs and foundation models, let’s explore how synthetic data created by more traditional generative AI models are set for mainstream adoption. Image generated by Arne Rustad using DALLE-3. Data is as valuable

The Language of Maps: A Guide to Geospatial Data Formats and Coordinates

From GeoJSON to UTM, these tools help map the world! Continue reading on Towards Data Science »

How is Causal Inference Different in Academia and Industry?

A Bonus Article for “The Book of Why” Series Continue reading on Towards Data Science »

Neural Networks For Periodic Functions

When ReLU’s extrapolation capabilities are not enough Continue reading on Towards Data Science »