Discover gists

Running Qwen3 Models with llama-server (Embedding + Reranking + Chat)

A practical guide to running multiple Qwen3 models through a single llama-server instance using model routing. Covers embedding, reranking, and chat/vision models.

Tested on Windows with RTX 3090 (24GB VRAM), llama-server build from llama.cpp master branch. Last updated: 2025-03-09.

What you get

name	description
orchestrating-swarms	Master multi-agent orchestration using Claude Code's TeammateTool and Task system. Use when coordinating multiple agents, running parallel code reviews, creating pipeline workflows with dependencies, building self-organizing task queues, or any task benefiting from divide-and-conquer patterns.

Claude Code Swarm Orchestration

Master multi-agent orchestration using Claude Code's TeammateTool and Task system.

PPP Subscription Pricing Breakdown

Core Formula

Target USD = $17.99 (base price) x PLR (Price Level Ratio)
New Price  = floor(Target USD) to nearest $X.99

With two hard boundaries:

Oh my zsh.

Install ZSH.

sudo apt install zsh-autosuggestions zsh-syntax-highlighting zsh

Install Oh my ZSH.

Visual Studio 2026

https://c2rsetup.officeapps.live.com/c2r/downloadVS.aspx?sku=community&channel=Stable&version=VS18

https://c2rsetup.officeapps.live.com/c2r/downloadVS.aspx?sku=enterprise&channel=Stable&version=VS18

https://c2rsetup.officeapps.live.com/c2r/downloadVS.aspx?sku=professional&channel=Stable&version=VS18

https://aka.ms/vs/18/Stable/vs_community.exe

Disabling IPv6

Source: https://www.leowkahman.com/2016/03/19/disable-ipv6-raspberry-raspbian/

Firstly, check for presence of IPv6 using ifconfig. You should be seeing a few lines containing inet6 addr: ....

To disable, edit a file: sudo nano /etc/sysctl.conf

Add the following line:

net.ipv6.conf.all.disable_ipv6 = 1

KVM/QEMU Setup Guide for Ubuntu 24.04

Overview

This guide covers the complete setup of KVM (Kernel-based Virtual Machine) with QEMU and libvirt on Ubuntu 24.04 LTS. KVM provides near-native performance virtualization on Linux systems and is ideal for homelab environments.

	"""
	The most atomic way to train and run inference for a GPT in pure, dependency-free Python.
	This file is the complete algorithm.
	Everything else is just efficiency.

	@karpathy
	"""

	import os # os.path.exists
	import math # math.log, math.exp

	8 Eyes (U) [!].nes
	Abadox (J).nes
	Addams Family, The (E) [!].nes
	Adventures in the Magic Kingdom (E) [!].nes
	Adventures of Lolo (U) [!].nes
	Aladdin 4 (1996) (Unl) [!].nes
	Alien 3 (E) [!].nes
	Alien Syndrome (J).nes
	Antarctic Adventure (J).nes
	Astyanax (U) [!].nes

	"""
	The most atomic way to train and run inference for a GPT in pure, dependency-free Python.
	This file is the complete algorithm.
	Everything else is just efficiency.

	@karpathy
	"""

	import os # os.path.exists
	import math # math.log, math.exp