HyperAccel Tech Blog

Welcome to HyperAccel Tech Blog
Kubernetes logo surrounded by various hardware device icons

Building an In-House Dev Environment on Kubernetes Part 3: Kubernetes Device Plugin for LPU

Building an In-House Dev Environment on Kubernetes Part 3: Kubernetes Device Plugin for LPU Hello! I’m Younghoon Jun, a DevOps Engineer on the ML team at HyperAccel. This post is the third installment of the Building an In-House Dev Environment on Kubernetes series! In Part 1, we covered the background, overall design, and direction of building a Kubernetes-based development environment. Part 2 introduced the strategy and process for building an ARC-based CI/CD infrastructure to overcome the structural limitations of self-hosted runners. In this third article, we will discuss the Device Plugin required for utilizing custom resources on Kubernetes. ...

March 27, 2026 · 19 min · 3851 words
AMD Instinct MI300X launch event

AITER Analysis: How AMD Doubled ROCm Inference Performance

An analysis of AITER (AI Tensor Engine for ROCm), which boosts inference performance on AMD GPUs.

March 17, 2026 · 9 min · 1862 words
Transformer Architecture

Transformer World: A Deep Dive into the Building Blocks of LLMs

A hands-on walkthrough of Transformer-based LLM internals — from each module’s role to key optimization techniques.

March 5, 2026 · 15 min · 3010 words
Pallas Programming Model

Know Your Enemy, Know Yourself, Supplement: Pallas Programming Model

Learn about Pallas programming model that enables writing custom kernels on TPU.

February 25, 2026 · 15 min · 3193 words
ICMS and Bluefield-4 DPU

Know Your Enemy, Know Yourself, Part 4: Memory Capacity Bottleneck and NVIDIA ICMS

We explore the technical principles behind NVIDIA’s ICMS — a new storage tier designed to solve the KV cache capacity bottleneck in LLMs — and the Bluefield-4 DPU that manages it.

February 24, 2026 · 12 min · 2456 words
Coding with AI

From Copy-Paste to Autonomous Agents: A Subtle Introduction

From Copy-Paste to Autonomous Coding Agents: A Subtle Introduction Hello, my name is Taeseo Um, currently on the ML team at HyperAccel. There is a running joke among developers that our job has always been about googling the issue, clicking the first link that pops up, then copying from Stack Overflow. In some ways, the arrival of AI coding tools didn’t change what we do — it just made us dramatically better at it. Over the past few months, our team has ridden almost every wave of AI-assisted development, from the earliest interactions to the autonomous agents we rely on today. Here’s what that journey looked like, what we learned, and where we think it’s all heading. ...

February 19, 2026 · 8 min · 1689 words
ARC CI/CD Infrastructure

Building a Dev Environment with Kubernetes Part 2: CI/CD Infrastructure Modernization with ARC

The complete journey of redesigning HyperAccel’s CI/CD infrastructure based on Actions Runner Controller (ARC), including Vault JWT/Kubernetes Auth dual integration, and achieving pipeline observability through a custom-built Go-based Prometheus Exporter.

February 12, 2026 · 15 min · 3076 words
Kubernetes Logo

Building an Internal Development Environment with Kubernetes Chapter 1: Why Kubernetes?

Building an Internal Development Environment with Kubernetes Chapter 1: Why Kubernetes? Hello! I’m Younghoon Jun, a DevOps Engineer on the ML team at HyperAccel. For those of you reading this, what kind of environment do you develop in? I imagine you’re working in various environments—local setups, SSH server access, cloud services, and more. HyperAccel develops on an environment built on top of a Kubernetes cluster. We spin up Pods based on devcontainers pre-configured with the packages needed for development, then connect to the container to do our work. To make things easier for our internal developers, we’ve created and provided a Devcontainer Portal. ...

February 9, 2026 · 12 min · 2494 words
groq logo

Know Your Enemy, Know Yourself, Part 3: Groq's LPU (Acquired by NVIDIA for $20B)

We explore the background of Groq and LPU, their hardware/software design philosophy, and analyze NVIDIA’s intentions behind acquiring Groq.

February 3, 2026 · 20 min · 4156 words
Spec-Driven Development

Specs are All You Need: What is Spec-Driven Development (SDD)?

Spec-Driven Development (SDD) is a new development methodology for the AI era, moving beyond prompt engineering to spec engineering.

February 3, 2026 · 17 min · 3445 words