Posts

These posts describe everything I put out here that is not yet published in any other form. Usually, the content is about projects that I do in my spare time or just implementations of techniques that I find interesting.

Some of the articles are unfinished and will be updated later.

On the Implementation of AI Ethics, 24 Feb. 2025 (posts)

This is a (german) term paper that I wrote in 2019 (in a pre-LLM era) for a seminar on the philosophical aspects of AI. It discusses general strategies for implementing ethical behavior in AI systems at the example of autonomous vehicles. While somewhat outdated, it still constitutes a reasonable introduction to the topic. Einleitung § Lange Zeit galten menschliche Individuen und Gesellschaften als die einzigen intelligenten Entscheidungsträger. Durch die Fortschritte in der Informatik, …

Categories: Philosophy

2893 Words, Tagged with: Ethics · Autonomous Vehicles

Thumbnail for On the Implementation of AI Ethics

Home Server Setup 2025, 26 Jan. 2025 (posts)

In this post, I want to present my current home server setup, including the hardware, the virtualized infrastructure (Networks, VMs), and the services (Containers) I am running.1 The goal is to give you some inspiration and also to have some more thorough documentation for myself. While writing, I noticed some possible improvements, so there is value in the documentation process itself. This post will be quite long as the infrastructure evolved over a prolonged period. To avoid convoluting it …

Categories: Homeserver

3113 Words, Tagged with: Homeserver · Virtualization

Training a German LLM from Scratch, 14 Nov. 2024 (posts)

This article is not finished and will be updated. The research group I work with has access to a small GPU cluster, which occasionally sits idle. To avoid wasting valuable compute resources (IDLE GPUs essentially burn money through opportunity costs), I decided to train a German GPT-2-style model from scratch, using only German text. Existing German models available on Hugging Face have 137M parameters and a context length of 1024 tokens1, which is quite limited compared to recently released …

Categories: Deep Learning

2794 Words, Tagged with: Deep Learning · Generative Models · LLM

Thumbnail for Training a German LLM from Scratch

Mining the Bundestag, 22 Jan. 2023 (posts)

Did you know that the German parliament publishes protocols for all of its proceedings in PDF format? It is relatively straightforward to download and parse them, so we can easily collect a dataset of transcripts of what seems to be every speech in the Bundestag since the Second World War. My original idea was to mine the speeches for word associations. Some words will be associated with other words based on the intended connotation, and this association might change over time as the …

Categories: Data Mining

1025 Words, Tagged with: Bundestag · Data Mining · Generative Models

Mining tagesschau.de, 26 Nov. 2022 (posts)

I like to read tagesschau.de, so I wrote a script to scrape it in regular intervals. My original goal was to determine which articles stay on the front page the longest, which ones allow commenting (a feature that seems to have been disabled almost entirely since March 2020), and if articles are modified after the initial release (without mentioning this), because I sometimes feel that headlines change. Dataset Creation § Tagesschau provides a JSON API, so fetching all of the articles is …

Categories: Data Mining

1046 Words, Tagged with: Tagesschau · Generative Models · Data Mining

Convolutional Filter Visualization, 27 Jul. 2022 (posts)

Deep Neural Networks are black-boxes: they map some input to some output, and we can make them do this surprisingly well. However, we usually have no idea how this mapping works. Particularly Convolutional Neural Networks (CNNs), which employ “convolutions” as filters, achieved some impressive results (before Vision Transformers came along). Filter Visualization can help us understand what kind of patterns the convolutional filters in CNNs detect. Why would we want to do it? § …

Categories: Deep Learning

472 Words, Tagged with: Deep Learning · Explainability · CNN

Thumbnail for Convolutional Filter Visualization

Explanation-based Anomaly Detection in Deep Neural Networks, 01 Feb. 2020 (posts)

Masters Thesis (PDF). If an AI gives you a weird explanation for its prediction, you should remain septical about the accuracy of the prediction. Sounds reasonable? This was the general idea of my masters thesis, which was originally titled Self-Assessment of Visual Recognition Systems based on Attribution. Today, I would call it Explanation-based Anomaly Detection in Deep Neural Networks. The general idea was to use attribution-based explanation methods to detect anomalies (such as unusual …

Categories: Anomaly Detection

340 Words, Tagged with: Deep Learning · Anomaly Detection · CNN · Explainability