About me

Interests

Data Engineering, the modern data stack, and in the question on how to build a successful data infrastructure

Data Science, NLP, and the effects of Data Science on our society

Software Engineering best practices, such as testing, documentation, readable code and code reviews

Cloud computing

Technologies

Data Science: Python (Scikit-Learn, PyTorch, TensorFlow, Pandas, Numpy), SQL

Data Engineering: dbt, Snowplow, BigQuery, PySpark, Airflow

Web Development: FastAPI, TailwindCSS, htmx, jinja

DevOps: Git, Docker, Github Actions, Terraform (basics)

NLP: Gensim, Spacy, Huggingface

Data Visualization: Looker, Streamlit