I have over the last years assembled knowledge on how to build data processing systems, by reading and watching others, trying things that turned out to be mistakes, and occasionally hitting good spots. I believe in sharing things, so I will put pieces out on my web site, gradually building a resource site for data engineers. The first piece is a data engineering reading & watching list with the articles, books, and presentations I find to be most valuable or influential.
Welcome to my blog. Posts will be infrequent, but have high signal-to-noise ratio. The primary focus will be practical technical aspects of scalable batch and real-time processing, complemented with occasional posts on general software engineering and technical productivity.