Meta releases Llama 3 language models 

Meta has announced the release of the first two models of its next-generation Llama language model series, Llama 3. The release features pretrained and instruction-fine-tuned language models with 8B and 70B parameters.

To develop Llama 3, Meta says it focused on four key aspects: model architecture, pretraining data, scaling up pretraining, and instruction fine-tuning. The models use a standard decoder-only transformer architecture with a 128K token vocabulary and are pretrained on over 15T tokens collected from publicly available sources.

Meta has also developed trust and safety tools, including updated components with Llama Guard 2, Cybersec Eval 2, and the introduction of Code Shield, an inference time guardrail for filtering insecure code produced by LLMs.

The Llama 3 models will be available on major platforms, including cloud providers and model API providers like AWS, Databricks, Google Cloud, Hugging Face, Kaggle, IBM WatsonX, Microsoft Azure, Nvidia NIM, and Snowflake, Meta said in a statement. Hardware platforms supporting Llama 3 include AMD, AWS, Dell, Intel, Nvidia, and Qualcomm.

Meta’s largest models, currently in training, are over 400B parameters and are expected to offer new capabilities such as multimodality, multilingual conversation, and longer context windows.

Meta has integrated the Llama 3 models into its AI assistant, Meta AI, which is now available in more countries across its apps, including Facebook, Instagram, WhatsApp, Messenger, and the web. The models can be downloaded from the Llama 3 website, and a Getting Started Guide is available for reference.

[Image courtesy: Meta]

Updated: April 23, 2024

Just in

Capgemini partners with DARPA to explore quantum computing for carbon capture

Capgemini Government Solutions has launched a new initiative with the Defense Advanced Research Projects Agency (DARPA) to investigate quantum computing's potential in carbon capture.

Snowflake to acquire TruEra AI observability platform

Snowflake has entered into a definitive agreement to acquire TruEra, providers of an AI observability platform. Financial terms of the transaction were not disclosed.

Biggest Windows 11 update in 2 years nearly finalized, enters Release Preview — Ars Technica

Microsoft has released Windows 11 24H2 build 26100.712 to its Release Preview testing channel for Windows Insiders, a sign that the update is nearly complete and that the company has shifted into bug-fixing mode ahead of general availability, writes Andrew Cunningham in Ars Technica.