Business

Microsoft launches robust AI 'small language model' for researchers

Sun, Dec 17 2023 12:39:20 PM

New Delhi, Dec 17 (IANS): Microsoft has released its newest compact “small language model” titled Phi-2 that continues to perform at par or better than certain larger open-source Llama 2 models with less than 13 billion parameters.

Over the past few months, the Machine Learning Foundations team at Microsoft Research has released a suite of small language models (SLMs) called “Phi” that achieve remarkable performance on a variety of benchmarks.

The first model, the 1.3 billion parameter Phi-1 achieved state-of-the-art performance on Python coding among existing SLMs (specifically on the HumanEval and MBPP benchmarks).

"We are now releasing Phi-2, a 2.7 billion-parameter language model that demonstrates outstanding reasoning and language understanding capabilities, showcasing state-of-the-art performance among base language models with less than 13 billion parameters,” the company said in an update.

Phi-2 is an ideal playground for researchers, including for exploration around mechanistic interpretability, safety improvements, or fine-tuning experimentation on a variety of tasks.

“We have made Phi-2 available in the Azure AI Studio model catalog to foster research and development on language models,” said Microsoft.

The massive increase in the size of language models to hundreds of billions of parameters has unlocked a host of emerging capabilities that have redefined the landscape of natural language processing.

However, a question remains whether such emergent abilities can be achieved at a smaller scale using strategic choices for training, e.g., data selection.

“Our line of work with the Phi models aims to answer this question by training SLMs that achieve performance on par with models of much higher scale (yet still far from the frontier models),” said Microsoft.

The company has also performed extensive testing on commonly used prompts from the research community.

“We observed a behaviour in accordance with the expectation we had given the benchmark results,” said the tech giant.

Follow Daijiworld News Network on

Latest

India, US to hold key trade talks in Delhi ahead of tariff deadline

PhonePe wallet inactivity charges apply only to wallet balance, not bank accounts or UPI

Bharti Airtel leads gains as top firms add Rs 2.15 lakh cr in market value

Hyderabad’s HRV Pharma builds API business without owning a factory, targets Rs 1,000 cr revenue

The wait is almost over: NorthernSky Excelsa nears completion at Kadri Hills

Private equity firms revive dividend deals as investors seek debt opportunities

India’s low per-capita steel consumption signals strong growth potential: Report

Business

Microsoft launches robust AI 'small language model' for researchers

Top Stories

The wait is almost over: NorthernSky Excelsa nears completion at Kadri Hills

Leave a Comment Your Email address will not be published.

Title: Microsoft launches robust AI 'small language model' for researchers

You might also like

Delaware governor declares June 21 as International Day of Yoga

Los Angeles declares local emergency as massive warehouse fire continues to blanket city in smoke

Fed keeps interest rates unchanged amid inflation worries, global uncertainty

Trump says no Strait of Hormuz tolls unless imposed by US amid Iran tensions

Trump and Meloni clash pblicly as once-close alliance shows signs of strain

Trump sets 60-day deadline for Iran deal as US-Iran talks face delay

80% Americans feel US lacks focus on civic education: NBC News poll

LAPD releases bodycam video of officer shooting dog during apartment response

Trump unveils new Air Force One aircraft with Qatar-owned jet makeover

Senator Mark Warner questions effectiveness of Trump’s Iran agreement, warns of unresolved risks

Brief tension at Ahmedabad NEET re-exam centre over dress code, Police restore order

ED freezes Rs 2.93 crore, seizes cash in MP road construction scam probe

Four killed, dozens injured as NBSTC bus crashes into parked trailer in Jalpaiguri

Six killed, several hospitalised after suspected ammonia leak at seafood processing unit

Trinamool leaders resign from panchayat and municipality posts after poll defeat

Andhra Pradesh forms SIT to probe alleged custodial death of missing youth in Vijayawada

NEET-UG re-exam: Candidates hope for fair process amid anxiety over previous paper leak

Three children drown in Ganga River while bathing in Bihar

PM Modi commissions three indigenous warships

YouTuber ‘Thoppi’ booked for allegedly posting obscene videos on social media