All articles

Jun 8, 2026

How AI Agents Reshape Knowledge Work

Computer raises task autonomy, lowers cost, and widens the scope of work users take on.

Jun 1, 2026

Rethinking Search as Code Generation

Evolving search from monolithic services to programmable primitives for the era of agent harnesses.

May 20, 2026

Improving Unigram Tokenizer CPU Performance

We reimplemented our Unigram tokenizer from scratch as a focused performance project.

May 14, 2026

Query-Aware Context Compression for Better Snippets

Improving the quality-efficiency frontier of model context through query-aware context compression models.

research

May 12, 2026

Hosting Qwen on Blackwell

May 6, 2026

CuTeDSL at Perplexity

research

May 1, 2026

Designing, Refining, and Maintaining Agent Skills at Perplexity

research

Apr 22, 2026

Advancing Search-Augmented Language Models

Feb 26, 2026

pplx-embed: State-of-the-Art Embedding Models for Web-Scale Retrieval

Today we are releasing pplx-embed-v1 and pplx-embed-context-v1, two state-of-the-art text embedding models built for real-world, web-scale retrieval.

Load more

Jun 8, 2026

How AI Agents Reshape Knowledge Work

Computer raises task autonomy, lowers cost, and widens the scope of work users take on.

Jun 1, 2026

Rethinking Search as Code Generation

Evolving search from monolithic services to programmable primitives for the era of agent harnesses.

May 20, 2026

Improving Unigram Tokenizer CPU Performance

We reimplemented our Unigram tokenizer from scratch as a focused performance project.

May 14, 2026

Query-Aware Context Compression for Better Snippets

Improving the quality-efficiency frontier of model context through query-aware context compression models.

research

May 12, 2026

Hosting Qwen on Blackwell

May 6, 2026

CuTeDSL at Perplexity

research

May 1, 2026

Designing, Refining, and Maintaining Agent Skills at Perplexity

research

Apr 22, 2026

Advancing Search-Augmented Language Models

Load more

Jun 8, 2026

How AI Agents Reshape Knowledge Work

Computer raises task autonomy, lowers cost, and widens the scope of work users take on.

Jun 1, 2026

Rethinking Search as Code Generation

Evolving search from monolithic services to programmable primitives for the era of agent harnesses.

May 20, 2026

Improving Unigram Tokenizer CPU Performance

We reimplemented our Unigram tokenizer from scratch as a focused performance project.

May 14, 2026

Query-Aware Context Compression for Better Snippets

Improving the quality-efficiency frontier of model context through query-aware context compression models.

research

May 12, 2026

Hosting Qwen on Blackwell

May 6, 2026

CuTeDSL at Perplexity

Load more