Meta launches Llama 3 artificial intelligence model, providing a 70B parameter version with greatly improved performance

Meta Artificial Intelligence Research Institute today launched the Llama 3 model. This model has been trained with 15T (trillion) tokens and provides a language model that has been pre-trained and fine-tuned with instructions. It is divided into 8B and 70B parameter versions and can be used in various environments. use.

Compared with Llama 2, the new version provides new features and improved reasoning capabilities, significantly reduces false rejection rates, supports multi-language and multi-mode, has longer context, and also improves the overall performance of core functions such as reasoning and programming.

In some benchmark tests, the performance of Llama 3 exceeds Mistral-7B, Mistral 8x22B and Google Gemini Pro version 1.0, and it is also the best-performing open AI model currently.

In order to maximize the performance of Llama 3 in chat scenarios, Meta has also innovated the instruction fine-tuning method, including using a combination of supervised fine-tuning, rejection sampling, proximal policy optimization and direct policy optimization, especially through proximal policy optimization and Direct policy optimization significantly improves Llama 3's reasoning and programming performance.

For example, Meta said that if the user asks the model a reasoning question that is difficult to answer, the model will sometimes produce the correct reasoning trajectory. The model knows how to generate the correct answer, but does not know how to choose this answer, and the preference ranking Training allows the model to learn how to choose this answer.

On the security side, Meta is updated on Llama Guard 2 and Cyber Sec Eval 2, and also introduces Code Shield, an inference time guard used to filter large language models that generate unsafe code, which can improve the overall performance of Llama 3. safety.

From now on, the Llama 3 model is available on major cloud computing platforms, including Amazon AWS and Google Cloud. Developers can also download the model for deployment.

After the release of Llama 3, Meta is training the next generation of Llama, the largest model of which has more than 400B parameters, but these models are still being trained. Meta hopes to roll out a multimodal version in the coming months and continue to expand contextual support.

related information:

Robin Li continued to say at the Baidu AI Developer Conference that the open source model will only become more and more backward.

Copyright statement: Thank you for reading. Unless the source website name or link is indicated in the article, it is the original content of Blue Dot.com.When reprinting, please be sure to indicate: Source: bluedot.com, author andFull link to this article,Thank you for understanding.

Meta launches Llama 3 artificial intelligence model, providing a 70B parameter version with greatly improved performance

What do you think?

The art of artifact collection and hoarding for the sake of forensic exclusivity…

Multi-EuP: Analysis of Bias in Information Retrieval – Conclusion, Limitations, and Ethics Statement

Smashing Security podcast #370: The closed loop conundrum, default passwords, and Baby Reindeer

Webinar Recap: Data-Driven Insights to Navigate Today’s Security Challenges

United HealthCare CEO says ‘maybe a third’ of U.S. citizens were affected by recent hack

Product Release: PreVeil 5.0

The art of artifact collection and hoarding for the sake of forensic exclusivity…

Multi-EuP: Analysis of Bias in Information Retrieval – Conclusion, Limitations, and Ethics Statement

Smashing Security podcast #370: The closed loop conundrum, default passwords, and Baby Reindeer

Webinar Recap: Data-Driven Insights to Navigate Today’s Security Challenges

United HealthCare CEO says ‘maybe a third’ of U.S. citizens were affected by recent hack

Product Release: PreVeil 5.0

Leave a ReplyCancel reply

Cheats For Little Alchemy

3TB Of Mega.nz Links For Free Courses And E-Books 2022 (Updated)

How to Earn Money from FreeCash.com, Playing Games, Testing Apps, and Taking Surveys

Amazon FBA Product Research & Find Products for Amazon FBA

Udemy Coupon [100% OFF] QuickBooks Online 2020

How Much Do Car Accident Attorneys Cost You in 2022?

SeedHunter Marketing Module Is live – Web3 Influencer Campaigns With Payment In Stable Coins

It’s time for governance!Reference path for data security protection in the industrial field

What do you think?

Leave a ReplyCancel reply

Log In

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections