Archives for Florian Wilhelm's blog

Unlocking the Power of Integer Programming: A Beginner’s Guide to Conference Scheduling

WALD: A Modern & Sustainable Analytics Stack

The Programme of the PyConDE / PyData 2023 in Berlin

Forget about AI and do Mathematical Modelling instead!

Effective and Consistent Configuration via YAML & CLI with Hydra

Matrix Factorization for Collaborative Filtering Is Just Solving an Adjoint Latent Dirichlet Allocation Model After All

Handling Anaconda without getting Constricted

Using Google BigQuery with Programmatic SQL

A Practical Comparison of Polars and Pandas

Finally! Bayesian Hierarchical Modelling at Scale

Honey, I shrunk the target variable

Are you sure about that?! Uncertainty Quantification in AI

More Efficient UD(A)Fs with PySpark

Querying NoSQL with Deep Learning to Answer Natural Language Questions

Working efficiently with JupyterLab Notebooks

Performance evaluation of GANs in a semi-supervised OCR use case

Multiplicative LSTM for sequence-based Recommenders

Bridging the Gap: from Data Science to Production

How mobile.de brings Data Science to Production for a Personalized Web Experience

Managing isolated Environments with PySpark

Data Science in Production: Packaging, Versioning and Continuous Integration

Efficient UD(A)Fs with PySpark

Declarative Thinking and Programming

“Which car fits my life?” - mobile.de’s approach to recommendations

Causal Inference and Propensity Score Methods

Hive UDFs and UDAFs with Python

Handling GPS Data with Python

Leveraging the Value of Big Data with Automated Decision Making

Interactively visualizing distributions in a Jupyter notebook with Bokeh

Explaining the Idea behind ARD and Bayesian Interpolation

Introduction to the Python Data Science Stack

How to write a friendly reminder bot

Extending Scikit-Learn with your own regressor

How to setup a new Python project

Podcast about Predictive Analytics

Handling Big Data with Python