All articles

Jun 8, 2026
How AI Agents Reshape Knowledge Work
Computer raises task autonomy, lowers cost, and widens the scope of work users take on.

Jun 1, 2026
Rethinking Search as Code Generation
Evolving search from monolithic services to programmable primitives for the era of agent harnesses.

May 20, 2026
Improving Unigram Tokenizer CPU Performance
We reimplemented our Unigram tokenizer from scratch as a focused performance project.

May 14, 2026
Query-Aware Context Compression for Better Snippets
Improving the quality-efficiency frontier of model context through query-aware context compression models.

research
May 12, 2026
Hosting Qwen on Blackwell

May 6, 2026
CuTeDSL at Perplexity

research
May 1, 2026
Designing, Refining, and Maintaining Agent Skills at Perplexity

research
Apr 22, 2026
Advancing Search-Augmented Language Models

Feb 26, 2026
pplx-embed: State-of-the-Art Embedding Models for Web-Scale Retrieval
Today we are releasing pplx-embed-v1 and pplx-embed-context-v1, two state-of-the-art text embedding models built for real-world, web-scale retrieval.
Load more

Jun 8, 2026
How AI Agents Reshape Knowledge Work
Computer raises task autonomy, lowers cost, and widens the scope of work users take on.

Jun 1, 2026
Rethinking Search as Code Generation
Evolving search from monolithic services to programmable primitives for the era of agent harnesses.

May 20, 2026
Improving Unigram Tokenizer CPU Performance
We reimplemented our Unigram tokenizer from scratch as a focused performance project.

May 14, 2026
Query-Aware Context Compression for Better Snippets
Improving the quality-efficiency frontier of model context through query-aware context compression models.

research
May 12, 2026
Hosting Qwen on Blackwell

May 6, 2026
CuTeDSL at Perplexity

research
May 1, 2026
Designing, Refining, and Maintaining Agent Skills at Perplexity

research
Apr 22, 2026
Advancing Search-Augmented Language Models
Load more

Jun 8, 2026
How AI Agents Reshape Knowledge Work
Computer raises task autonomy, lowers cost, and widens the scope of work users take on.

Jun 1, 2026
Rethinking Search as Code Generation
Evolving search from monolithic services to programmable primitives for the era of agent harnesses.

May 20, 2026
Improving Unigram Tokenizer CPU Performance
We reimplemented our Unigram tokenizer from scratch as a focused performance project.

May 14, 2026
Query-Aware Context Compression for Better Snippets
Improving the quality-efficiency frontier of model context through query-aware context compression models.

research
May 12, 2026
Hosting Qwen on Blackwell

May 6, 2026
CuTeDSL at Perplexity
Load more