Scientific publication
T. M. Lange, M. Gültas, A. O. Schmitt & F. Heinrich (2025). optRF: Optimising random forest stability by determining the optimal number of trees. BMC bioinformatics, 26(1), 95. Follow this LINK to the original publication.
Random Forest — A Powerful Tool for Anyone Working With Data
What is Random Forest?
Have you ever wished you…
Introduction
Recent large language models (LLMs) — such as OpenAI’s o1/o3, DeepSeek’s R1 and Anthropic’s Claude 3.7 — demonstrate that allowing the model to think deeper and longer at test time can significantly enhance model’s reasoning capability. The core approach underlying their deep thinking capability is called chain-of-thought (CoT), where the model iteratively generates intermediate…

Which Outcome Matters? Here is a common scenario : An A/B test was conducted, where a random sample of units (e.g. customers) were selected for a campaign and they received Treatment A. Another sample was selected to receive Treatment B. “A” could be a communication or offer and “B” could be no communication or no…

In the world of machine learning, we obsess over model architectures, training pipelines, and hyper-parameter tuning, yet often overlook a fundamental aspect: how our features live and breathe throughout their lifecycle. From in-memory calculations that vanish after each prediction to the challenge of reproducing exact feature values months later, the way we handle features can…

Why distributed tracing is the key to resolving performance issues (Image by Author) - Distributed tracing — ideaMy articles are free for everyone to read! If you don’t have a Medium subscription, feel free to explore the full article directly on my blog: https://blog.bytedoodle.com/distributed-tracing-a-powerful-approach-to-debugging-complex-systems/ M odern applications are increasingly built using microservices, where hundreds of…

Insights from a synthetic control group 16 min read · 10 hours ago Photo by Maria Ziegler on UnsplashReal estate is a bedrock of modern economies, serving as both a tangible asset and an essential component of wealth accumulation for individuals and investment portfolios. Real estate price fluctuations have far-reaching implications, influencing…

How Meta-CoT enhances system 2 reasoning for complex AI challenges Image created by the author using Generative AI (Flux-pro)What makes a language model smart? Is it predicting the next word in a sentence ‒ or handling tough reasoning tasks that challenge even bright humans? Today’s Large Language Models (LLMs) create smooth text plus solve simple…

A basic understanding of NeRF’s workings through visual representations Who should read this article? This article aims to provide a basic beginner level understanding of NeRF’s workings through visual representations. While various blogs offer detailed explanations of NeRF, these are often geared toward readers with a strong technical background in volume rendering and 3D graphics.…

Data Science Explore the power of regex and save time in data analysis Image created by Author using Meta AIData is rarely clean and never in the required structure!! Whether you are starting with data science or are an experienced professional — You won’t deny the above statement! In a data analyst’s career extracting actionable…

Polynomial Fit in Python with NumPy Ball Tracking and Trajectory PredictionIn a previous project I visualized the trajectory of a ball that I threw vertically into the air with a real-time position, velocity and acceleration plot. Extending upon this project, I wanted to calculate and visualize a trajectory prediction based on a simple physics model.…