Meta Unveils Biggest Llama 3 AI Model, Claiming Language And Math Gains

The model is set to be free, challenging the subscription based ChatGPT-4

New York:

Meta Platforms released the biggest version of its mostly free Llama 3 artificial intelligence models on Tuesday, boasting multilingual skills and general performance metrics that nip at the heels of paid models from rivals like OpenAI.

The new Llama 3 model can converse in eight languages, write higher-quality computer code and solve more complex math problems than previous versions, the Facebook parent company said in blog posts and a research paper announcing the release.

With 405 billion parameters, or variables that the algorithm takes into account to generate responses to user queries, it dwarfs the previous version released last year though is still smaller than leading models offered by competitors.

OpenAI’s GPT-4 model, by contrast, is reported to have one trillion parameters and Amazon is preparing a model with 2 trillion parameters.

Promoting Llama 3 across multiple channels, Chief Executive Mark Zuckerberg said he expected future Llama models would overtake proprietary competitors by next year. The Meta AI chatbot powered by those models was on track to become the most popular AI assistant by the end of this year, with hundreds of millions of people using it already, he said.

The release comes as tech companies are racing to show that their growing portfolios of resource-hungry large language models can deliver significant enough gains in known problem areas like advanced reasoning to justify the gargantuan sums that have been invested in them.

Meta’s own top AI scientist has said he believes such models will hit up against limits on reasoning and that other types of AI systems will be needed to produce breakthroughs.

In addition to its flagship 405 billion parameter model, Meta is also releasing updated versions of its lighter-weight 8 billion and 70 billion parameter Llama 3 models initially introduced in the spring, the company said.

All three new models are multilingual and can handle larger user requests via an expanded “context window,” which Meta’s head of generative AI, Ahmad Al-Dahle, said would improve the experience of generating computer code in particular.

“That was the number one feedback we got from the community,” Al-Dahle told Reuters in an interview, noting that bigger context windows give the models something akin to a longer memory that aids in processing multi-step requests.

Separately, Al-Dahle said his team had been able to improve the Llama 3 model’s performance on tasks such as solving math problems by using AI to generate some of the data on which they were trained.

Meta releases its Llama models largely free-of-charge for use by developers, a strategy Zuckerberg says will pay off in the form of innovative products, less dependence on would-be competitors and greater engagement on the company’s core social networks. Some investors have raised their eyebrows at the costs entailed, however.

The company also stands to benefit if developers opt to use its free models over paid ones, which would undercut the business models of its rivals. With its announcement, Meta touted gains on key math and knowledge tests that may make that prospect more appealing.

Although measuring progress on AI development is notoriously difficult, test results provided by Meta appeared to suggest that its largest Llama 3 model was nearly matching and, in some cases, besting Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4o, which are widely regarded as the two most powerful frontier models on the market.

On the MATH benchmark of competition level math word problems, for example, Meta’s model posted a score of 73.8, compared to GPT-4o’s 76.6 and Claude 3.5 Sonnet’s 71.1.

The model scored 88.6 on MMLU, a benchmark that covers dozens of subjects across math, science and the humanities, while GPT-4o scored 88.7 and Claude 3.5 Sonnet scored 88.3.

In their paper, Meta researchers also teased upcoming “multimodal” versions of the models due out later this year that layer image, video and speech capabilities on top of the core Llama 3 text model.

Early experiments indicate those models can perform “competitively” with other multimodal models such as Google’s Gemini 1.5 and Anthropic’s Claude 3.5 Sonnet, they said.

(Except for the headline, this story has not been edited by NDTV staff and is published from a syndicated feed.)

Source link

St. John’s vs. New Mexico prediction: College basketball odds, picks

O.C. firefighter who feared he was paralyzed in crash walks out of rehab

Bullet strikes Southwest Airlines plane at Dallas Love Field Airport : NPR

Dem Rep. torches Harris campaign for relying on celebrity endorsements: ‘No one cares’

Refined carbs and red meat driving global rise in type 2 diabetes, study says

Texas Governor removes over 1 million from voter roll

University of Michigan’s Student Government Votes To Oust Pro-Palestinian President for Inciting Violence

Biden Backs Down on Israel Arms Ultimatum

Democrat Ruben Gallego wins Arizona U.S. Senate race, defeating Republican Kari Lake

St. John’s vs. New Mexico prediction: College basketball odds, picks

O.C. firefighter who feared he was paralyzed in crash walks out of rehab

Bullet strikes Southwest Airlines plane at Dallas Love Field Airport : NPR

Dem Rep. torches Harris campaign for relying on celebrity endorsements: ‘No one cares’

Refined carbs and red meat driving global rise in type 2 diabetes, study says

Texas Governor removes over 1 million from voter roll

University of Michigan’s Student Government Votes To Oust Pro-Palestinian President for Inciting Violence

Biden Backs Down on Israel Arms Ultimatum

Democrat Ruben Gallego wins Arizona U.S. Senate race, defeating Republican Kari Lake

Meta Unveils Biggest Llama 3 AI Model, Claiming Language And Math Gains

An update on the 22 July 2024 in Gofa Zone, Ethiopia landslides

What is the optimal amount of exercise and how much is too much?

Related News

Russia launches massive attack targeting energy infrastructure in Ukraine

Airport chaos as gun fired and bullet hits plane and closes runway | World | News

The X exodus – could Bluesky spike spark end of Elon Musk’s social media platform? | Science, Climate & Tech News

Terrifying moment horror turbulence sends screaming passengers flying out of seats and forces O2 masks to be deployed

What is the optimal amount of exercise and how much is too much?

Discussion about this post

Subscribe To Our Newsletters

Customer Support

Subscribe To Our Newsletters

Categories

Recent News

Russia launches massive attack targeting energy infrastructure in Ukraine

Jelly Roll’s 120-Pound Weight Loss Transformation

First known case of rare mpox strain confirmed in United States

The Justice System Still Has a Chance to Sentence Trump

Welcome Back!

Retrieve your password

Add New Playlist