Unfolding the universe of possibilities..

Whispers from the digital wind, hang tight..

How to perform anomaly detection with the LOF Algorithm

An introduction to performing outlier detection with the Local Outlier Factor (LOF) algorithm. Continue reading on Towards Data Science »

Step by Step Basics: Code Autodocumentation

Generate flawless documentation of Python code using Sphinx Photo by Dustin Humes on UnsplashYou can build beautiful, standardised and stylised documentation using just the docstrings in a few simple steps. A Data Scientist holds many responsibilities when working on a project, and one

Deploy A Production-Ready Streamlit App with Cloud Run and Cloud Build

How to ship containerized applications on a serverless architecture and over a CICD pipeline. Continue reading on Towards Data Science »

Process Pandas DataFrames with a Large Language Model

Seamless Integration of Python, Pandas, and LLM Continue reading on Towards Data Science »

Automated Detection of Data Quality Issues

This article is the second in a series about cleaning data using Large Language Models (LLMs), with a focus on identifying errors in tabular data sets. The sketch outlines the methodology we’ll explore in this article, which focuses on evaluating the

Entity Type Prediction with Relational Graph Convolutional Network (PyTorch)

This post proposes a Python setup for entity type prediction on heterogenous graphs, using the Relational Graph Convolutional Network (R-GCN). The setup uses the RGCNConv module from PyTorch. The code discussed in this post can be found on GitHub. Before

How to quantify customer problems for prioritization using churn survey

Understanding users’ needs and pain points is a critical component of business success. Churn surveys, a specific type of survey designed for customers who have stopped using a service, are a treasure trove of customer insights. However, the true power

7 Regrets From My First Year As A Data Scientist

What I wish I knew before I started my first data science job Continue reading on Towards Data Science »

Exploring Brazil’s National Accounts through a Dashboard

Implementation details and analytical possibilities Photo by Dominik Lückmann on Unsplash At the end of the first week of March 2024, the news revealed that Brazil’s GDP grew by almost 3% in 2023 compared to the previous year, reaching a total

Image generation with diffusion models using Keras and TensorFlow

Using Diffusion to generate images You must have heard of Dall-E 2. Published by Open AI, which is a model that generates realistic looking images from a given text prompt. You can check out a smaller version of the model here. Ever