Unfolding the universe of possibilities..

Navigating the waves of the web ocean

The Other Side of Data Contracts: Awakening Consumer Responsibility

Drive the path to value creation by formalising the commitments of data consumers Generated with Midjourney Balancing the Field: The Dual Responsibilities in Data Contracts A football game where half the team is unaware they need to score would indeed

Type-Hinting DataFrames for Static Analysis and Runtime Validation

How StaticFrame Enables Comprehensive DataFrame Type Hints Photo by Author Since the advent of type hints in Python 3.5, statically typing a DataFrame has generally been limited to specifying just the type: def process(f: DataFrame) -> Series: … This is inadequate, as it

SCD2 — Semantics and Styles

How small differences can have a big impact Continue reading on Towards Data Science »

Seeing with Sound: Empowering the Visually Impaired with GPT-4V(ision) and Text-to-Speech…

Enhancing Visual Impairment Navigation: Integrating GPT-4V(ision) and TTS for Advanced Sensory Assistance Continue reading on Towards Data Science »

The World’s Smallest Data Pipeline Framework

A simple and fast data pipeline foundation with sophisticated functionality. Photo by Ana Lucia Cottone on Unsplash Data wrangling is perhaps the job that occupies the most time from Data Scientists. Data wrangling includes cleaning, transforming, and generally manipulating data from

The Creative, Occasionally Messy World of Textual Data

For several years, the intersection of text and data stayed (more or less) within the realm of natural language processing (NLP) — the wide range of machine learning tasks that leverage textual data for prediction, classification, and recommendation tools. The rise of

Custom Scoring Functions in Scikit-Learn

A deep dive into scoring functions for use in RandomizedSearchCV, GridSearchCV and cross_val_score Continue reading on Towards Data Science »

AI’s Proactive Role in Outsmarting Corruption in Government

Image by author, created with the assistance of DALL.E 3 Revolutionising Efforts with Generative Adversarial Networks and Synthetic Data Introduction The recent explosion of generative Artificial Intelligence (AI) models has focused the world on ethics, risks, and security concerns, and the

Advanced Dimensionality Reduction Models Made Simple

Learn how to efficiently apply state-of-the-art Dimensionality Reduction methods and boost your Machine Learning models. Continue reading on Towards Data Science »

Improving the Strava Training Log

Visualising the training patterns of marathon runners using Strava, Python, and Matplotlib. Continue reading on Towards Data Science »