3x Inference Acceleration on RISC-V CPU! V-SEEK: Accelerating 14B LLM on SOPHON SG2042
Keywords: V-SEEK, LLM Inference Optimization, RISC-V, SOPHON SG2042, llama.cpp, NUMA Optimization V–SEEK: ACCELERATING LLM REASONING ON OPEN-HARDWARE SERVER-CLASS RISC-V PLATFORMS https://arxiv.org/abs/2503.17422 In recent years, the exponential growth of large language models (LLMs) has relied on GPU-based systems. However, CPUs are gradually becoming a flexible and cost-effective alternative, especially for inference (the phase where the model … Read more