Good technology blogs: a reading list for the holidays

Alexey Milovidov
Dec 29, 2025 - 8 minutes read

I enjoy reading blogs about performance optimizations, data structures and algorithms, database development, compilers, operating systems, programming languages, computer science and mathematics, hardware and networks, web development, computer graphics, and AI. I'll share some of my favorites with you, to help you survive the holidays.

Daniel Lemire #

Daniel is the author of many high-performance libraries and algorithms. His articles cover details of low-level performance optimizations, and he often finds opportunities to improve fundamental building blocks and computational kernels, used in every program, like parsing IP addresses, validating UTF-8, compressing sequences of numbers, converting numbers to strings, etc. His libraries and algorithms are adopted in ClickHouse too, for example, simdjson and Roaring bitmap. We have also contributed numerous bug fixes back to these libraries.

Ash Vardanyan #

Ash is one of the most visible enthusiasts of performance optimizations and hardware. He is the author of USearch (a library for vector search with HNSW) and StringZilla (a collection of computational kernels for string processing), and both of them are used in ClickHouse.

Wojciech Muła #

Highly optimized computational kernels with and without SIMD instructions. For some reason, this blog does not use HTTPS, but that does not diminish its value.

Daniel Kutenin #

Daniel is an expert in performance optimizations, algorithms, and compilers. He is also a ClickHouse contributor, where he has implemented many string algorithms, such as an adaptation of Volnitsky's substring search for case-insensitive strings and for searching multiple strings at once (better than Aho-Corasick on a small number of strings), integrated Hyperscan (now Vectorscan) for multi and fuzzy search, and added string distances. Most importantly, he is the author of Miniselect which is a library for partial sorting, also integrated into ClickHouse.

Fangrui Song (MaskRay) #

This is a blog about compilers, linkers, and executable binary formats. The amount of information in this blog will make you worry about yourself.

Yann Collet #

Yann is the author of lz4 and zstd, one of the most important libraries for databases. I integrated LZ4 in ClickHouse in 2012, and it was fantastic! I remember how I was amazed and eager to see his zhuff algorithm open-sourced, but he did even more by releasing zstd in 2015, which I immediately integrated in ClickHouse, replacing quicklz. We even found a few bugs in the first 0.x versions of zstd :) I spent a lot of time trying to understand the mechanics of LZ4, and ended up optimizing it just a little.

Yann now works at Facebook, and his personal blog stopped being updated. For more resources about compression, you can check the data compression forum and read Data Compression Explained from Matt Mahoney.

Dan Luu #

Dan has many great articles about CPU and performance profiling.

Brendan Gregg #

If you are interested in performance profiling, introspection, instrumentation, and tracing, read this blog.

Russ Cox #

Russ is the author of the re2 library, he writes about algorithms amongst other topics. Re2 was a revolution for us - I still remember how colleagues tried to convince me that giving end users access to queries with arbitrary regular expressions would end up with DoS, but I knew it was going to be alright because I use re2! This was proven by practice - in 15 years, there was not a single case to lose trust in re2. There are faster libraries, like vectorscan/hyperscan and more versatile libraries, but re2 remains unbeaten. This article about uninitialized memory is especially interesting.

Bruce Dawson #

Bruce has many articles about floating point numbers.

Probably Dance #

This blog by Malte Skarupke is amazing. Scroll down to find the tech articles. He didn't write the fastest hash table.

Alisa Sireneva #

Amazing blog about low-level optimizations, algorithms, and compilers. Don't worry, you can easily skip the articles about Rust if you want.

Kamila Szewczyk #

Good articles about data compression, such as this.

Salvatore Sanfilippo #

A blog from the author of Redis. He writes rarely, and not only about AI. I really liked his smaz compression library and linenoise (an alternative to readline). He writes about experiments and observations from a first person perspective, and he is able to rethink some ideas and approaches without influence. It's worth noting that Redis lacks multithreading, it does not have on-disk data structures, and it does not implement distributed consensus. Finally, its development speed is very low, and the author uses a very conservative approach to software development. That's why there are many Redis clones, but none of them are comparable in terms of its reach.

Mitchell Hashimoto #

This blog isn't about databases. It's great to read about the first-person perspective of the founder of Hashicorp, as well as the development of a terminal emulator in Zig.

Fabien Sanglard #

This blog is also not about databases, but there is a lot of good stuff in it, such as the article about pseudorandom permutations.

Ben E. C. Boyter #

Ben is the author of scc - a tool for counting lines of code. I especially liked his article about analyzing GitHub.

Nikita Lapkov #

Nikita has many good articles about databases. He is also a ClickHouse contributor, where he implemented the first version of always-on profiler.

Simon Willison #

Simon writes about AI. Even if you already get nauseous from this topic, don't run away, because Simon writes about first-person, hands-on experience with new models and tools. The articles are concise and unpretentious, which makes them pleasant to read.

Andrej Karpathy #

This is a great blog about AI, with articles like this you should probably have read ten years ago. Today he is active on YouTube.

Jay Mody #

This is such a great blog about AI! Only five articles, though.

Kyle Kingsbury #

He tests the consistency of distributed databases, and there is hardly any database that survives his testing. Kyle is the author of the Jepsen framework, which is also used in ClickHouse, as well as in any self-respecting distributed DBMS.

Database Architects #

A blog from developers of academic databases from TUM and CWI. The covered topics are often very similar to the development of ClickHouse.

DBMS Musings by Daniel Abadi #

A similar blog about databases, mostly column-oriented. Not updated since 2019.

Marc Brooker #

A blog about databases and distributed systems.

Mark Callaghan #

The last articles are all about how he compiled a certain version of MySQL or PostgreSQL and compared performance. It's very boring. I like this blog.

Raymond Chen #

A lot of interesting curiosities from the history of the development of Windows.

Bartosz Ciechanowski #

This blog isn't about databases or AI, and the articles appear very rarely. This is the best blog I've ever seen.

Justine Tunney #

She is the author of Cosmopolitan Libc, and the blog is great to read if you are interested in binary formats, linkers and loaders, machine instructions, operating systems, and hardware. Note that ClickHouse does not use Cosmopolitan Libc, as it is still hard to apply for large C++ applications. We have a build with Musl-libc, and I also hope that LLVM libc will eventually be suitable for ClickHouse.

Alex Kladov #

Numerous articles discuss low-level code and performance. Sometimes about Rust and Zig, but should you worry?

LLVM blog #

If you are interested in compilers.

LWN #

If you are interested in operating systems.

v8 #

If you are interested in web browsers.

Julia Evans #

A collection of brief notes on things worth knowing.

Rachel #

Many small observations about tech.

Charles Bloom #

Many good articles about data compression. From this blog, I learned the term "cache table", and cache tables are widely used in ClickHouse, for cache dictionaries, for data compression, and for small caches.

Peter Kankowski #

Low-level performance optimization tricks. For some reason, it is not updated anymore, but some of the old articles are golden.

Jeff Preshing #

Also not updated, but there are many good articles, like this one about pseudorandom permutations.

Marek #

It's been a long time since the last update, but there are many good articles, e.g., about bitslicing.

Thoughts #

While going through my favorite blogs, I've noticed that some of them have sadly stopped updating. Some blogs even disappeared, like a blog of Yury Lifshits about string and nearest neighbor algorithms or a blog of Eugene Kirpichev about functional data structures, such as incremental regular expressions. Sometimes there are rumors that the authors are still alive and continue to write somewhere on darknetFacebook, but about therapy, philosophy, or parenting. What I really want is for them to continue writing great articles about technology!

Share this post

Subscribe to our newsletter

Stay informed on feature releases, product roadmap, support, and cloud offerings!
Loading form...