Spring is here and the temperatures are rising in the AI landscape! With an unprecedented pace in breakthrough model releases, highlighted by Claude 3 surpassing the year-old GPT-4, the field is blossoming in all sorts of exciting new directions. In this brief blog post, we'll cover some of the latest notable developments from the industry, along with the most trending research papers of the month.
News Articles
Groq’s $20,000 LPU chip breaks AI performance records to rival GPU-led industry
India demands beta AIs secure government permission before going public
Elon Musk Sues OpenAI and Sam Altman for Violating the Company’s Principles
And the response: OpenAI says Elon Musk wanted 'absolute control' of the company
Humanoid AI-driven robotics startup Figure raises $675M at $2.6B valuation
Klarna’s AI Assistant Is Doing The Job Of 700 Workers, Company Says
Model Releases
Meta: V-JEPA
Google DeepMind: Gemini 1.5, Gemma
OpenAI: Sora
Cohere For AI: Aya
Stability AI: Stable Diffusion 3
Mistral: Mistral Large
Anthropic: Claude 3
Trending AI papers for March 2024
Generative Representational Instruction Tuning - N. Muennighoff et al. (Contextual AI) - 15 Feb. 2024
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval - P. Sarthi et al. (Stanford University) - 31 Jan. 2024
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models - Y. Shao et al. (Stanford University) - 21 Feb. 2024
In Search of Needles in an 11M Haystack: Recurrent Memory Finds What LLMs Miss - Y. Kuratov et al. (AIRI Institute) - 16 Feb. 2024
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models - S. De et al. (Google DeepMind) - 29 Feb. 2024
Chain-of-Thought Reasoning Without Prompting - Xuezhi Wang & Denny Zhou (Google DeepMind) - 15 Feb. 2024
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement - Z. Wu et al. (Shanghai AI Laboratory & ECNU) - 12 Feb. 2024
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits - S. Ma et al. (Microsoft Research) - 27 Feb. 2024
Genie: Generative Interactive Environments - J. Bruce et al. (Google DeepMind) - 23 Feb. 2024
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions - L. Tian et al. (Alibaba Cloud Intelligence) - 27 Feb. 2024
And some runner-ups that didn't make our top-10 cut, but are worth checking out:
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping - L. Lehnert et al. (Meta) - 21 Feb. 2024
ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval - S. Yoon et al. (Seoul National University) - 24 Feb. 2024
Approaching Human-Level Forecasting with Language Models - D. Halawi et al. (UC Berkeley) - 28 Feb. 2024
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT - J. Saad-Falcon et al. (Stanford University) - 12 Feb. 2024
You can find an annotated collection of these papers, and more, in Zeta Alpha, which allows you to effortlessly discover relevant literature and dive deeper into any of the topics close to your interests!
Finally, the full-length recording of our March 2024 Trends in AI webinar is available on YouTube.
For a short video overview describing the trending papers, check this out:
To join us live for the next installment of the Trends in AI webinar, make sure to sign up here.