Welcome
My name is Siarhei Berdachuk.
I'm a seasoned IT professional with over 30 years of experience in software engineering, team leadership, and cutting-edge solution design. I specialize in Java development, AI integration with Spring AI, and high-performance backend systems.
This blog is where I share insights on clean code, test-driven development, scalable architectures, and practical applications of AI in enterprise environments.
Latest Articles

10-Jun-2026
Running Gemma 4 12B Locally with Speculative Decoding on Ubuntu
A practical guide to running Gemma 4 12B locally on Ubuntu using llama.cpp, Hugging Face automatic model loading, and speculative decoding with MTP.

15-May-2026
Running Gemma 4 31B Faster at Home with Speculative Decoding
A practical guide to running Gemma 4 31B on a dual-GPU Ubuntu box using llama.cpp TurboQuant, Gemma 4’s MTP assistant head, and speculative decoding to nearly double token throughput.

11-Apr-2026
How We Made RAG Indexing Faster With an Adaptive Embedding Endpoint Pool
A simple explanation of how to speed up embeddings by routing work across fast and slow local AI endpoints without letting one slow batch block the whole indexing pipeline.

10-Apr-2026
Why Vector Search Is Harder Than It Looks (And Why It Matters)
A simple, practical introduction to embeddings, vector indexes, and real-world semantic search