Been training Grok-2 to improve its general instruction-following in past months. Excited to release it to everyone for free. Try our free API with $25 free credit as well!
Ziniu Hu
162 posts
- 做了一点微小的工作 No.1
- Been training the Grok Code Fast 1 model with the incredible team. It's a blazing fast model 🚀 that can solve a broad range of real-world agentic coding tasks. Excited to share it with the world, hope it help with your work!Introducing Grok Code Fast 1, a speedy and economical reasoning model that excels at agentic coding. Now available for free on GitHub Copilot, Cursor, Cline, Kilo Code, Roo Code, opencode, and Windsurf. x.ai/news/grok-code…
- Grok 4 is living 🚀
- In past months I saw some posts saying Reasoning models cannot do agent tasks. Try Grok Deepsearch that combines the world of two for free 🚀This is it: The world’s smartest AI, Grok 3, now available for free (until our servers melt). Try Grok 3 now: x.com/i/grok X Premium+ and SuperGrok users will have increased access to Grok 3, in addition to early access to advanced features like Voice Mode
00:00 - Been working on Post-Training the grok3-mini reasoning models, combining Reasoning and Function-Calling capabilities for building your agents. Don't forget our 150$ free credits to get started with these models: docs.x.ai/docs/data-shar…Meet the Grok 3 family, now on our API! Grok 3 Mini outperforms reasoning models at 5x lower cost, redefining cost-efficient intelligence. Grok 3, the world's strongest non-reasoning model, excels in tasks that need real world knowledge like law, finance, and healthcare.
- Thrilled to receive the KDD Dissertation Award Runner-Up, for my PhD works on Neural-Symbolic Reasoning. Sincerely thanks to my PhD advisors @YizhouSun and @kaiwei_chang, my letter supporters @yisongyue and @JHamrick. Thanks to the award committee @kdd_news for such honor.
00:00 - Interested in LLM + Tool-Use, via Tree-Search? This afternoon in #NeurIPS2023, #215, I'll present "AVIS: Autonomous Visual Information Seeking with Large Language Model Agent" (blog.research.google/2023/08/autono…) Feel free to drop by and chat.
- 🤔 How to let Large Language Models (LLMs) agent utilize diverse tools via Tree Search 🔍? In AVIS, we enable LLM Agent to dynamically traverse a transition graph with self-critic (when one path is not informative, backtrack to previous state). This achieves SOTA VQA result.Today on the blog, read all about AVIS — Autonomous Visual Information Seeking with Large Language Models — a novel method that iteratively employs a planner and reasoner to achieve state-of-the-art results on visual information seeking tasks → goo.gle/3P2y2mY
- Can LLMs play a hidden-identity board game "Renaissance Avalon"? Check out: arxiv.org/abs/2310.05036 Code: github.com/jonathanmli/Av… In this work, we built a game engine AvalonBench, consisting of several fixed rule baselines. We found ChatGPT 3.5 still cannot beat simple rules.
- How to control LLM behavior with LLM-as-a-judge? Check our paper: "Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller" Website: llm-self-control.github.io Paper: arxiv.org/abs/2406.02721 Code: github.com/HenryCai11/LLM…
- Excited to receive the #SoCalNLP Best Paper Award for our paper "Empowering Language Models with Knowledge Graph Reasoning for Question Answering". The paper link is: arxiv.org/abs/2211.08380 Thanks to the organizers and all the great collaborators!Replying to @UCSB_AIOur @MegagonLabs Best Paper Award winner was "Empowering Language Models with Knowledge Graph Reasoning for Question Answering" by Ziniu Hu et al from UCLA! Paper link: arxiv.org/abs/2211.08380 Thank you to award sponsor @MegagonLabs for supporting our event! (4/4)












