OpenAI’s new AI Reinforcement Fine-Tuning could transform how scientists use its models

The second day of OpenAI‘s 12 Days of OpenAI shifted to less spectacular, more enterprise interests compared to the general rollout of the OpenAI o1 model to ChatGPT on day one.

Instead, OpenAI announced plans to release Reinforcement Fine-Tuning (RFT), a way to customize its AI models for developers who want to adapt OpenAI’s algorithms for specific kinds of tasks, especially more complex ones. This release marks a clear shift toward enterprise applications compared to day one’s consumer-focused updates. You can think of RFT as a method for improving how AI models work through their reasoning for responses. Using a dataset and evaluation rubric from a developer lets OpenAI’s platform train their specialized AI without lots of expensive reinforcement from later experiences.

RFT could be a boon for AI tools employed in law and science. OpenAI highlighted in its live stream the CoCounsel AI assistant built with RFT by Thompson Reuters and how RFT helps researchers studying rare genetic diseases at Berkeley Lab. However, the business partnerships aren’t going to make much difference in the short term for average users of ChatGPT or other OpenAI products.

today we are announcing reinforcement finetuning, which makes it really easy to create expert models in specific domains with very little training data.livestream going now: https://t.co/ABHFV8NiKcalpha program starting now, launching publicly in q1December 6, 2024

Enterprise or consumer

If you’re more keen on the consumer side of things, don’t give up just yet. While the enterprise tilt contrasts with day one, it’s easy to imagine OpenAI wanting to have as broad a range of news during the 12 days as possible. There will almost certainly be plenty more consumer news to come. Perhaps alternating days or some other pattern.

Still, at least the ending joke from OpenAI was a little funnier than yesterday. The AI described how self-driving vehicles are popular in San Fransisco, and Santa is keen to make a self-driving sleigh as part of the trend. The problem is that it keeps hitting trees. What’s the problem? He didn’t pine-tune his models. Maybe the image ChatGPT made for TechRadar’s Editor-at-Large Lance Ulanoff will sell the humor better.

ChatGPT visualizing an OpenAI joke told during Day 2 of 12 Days of OpenAI.

(Image credit: ChatGPT)

You might also like…

https://cdn.mos.cms.futurecdn.net/BHjXPtvs6sK9QX4XiCPrHM-1200-80.jpg

Source link
erichs211@gmail.com (Eric Hal Schwartz)

Why hands-on digital skills will define the value of AI

‘I was stuck on the overpass with dump trucks all around me’: A mass Baidu robotaxi outage just caused traffic mayhem in China

‘I was stuck on the overpass with dump trucks all around me’: A mass Baidu robotaxi outage just caused traffic mayhem in China

‘Side effects may include curiosity’: Google’s $3 ChromeOS Flex kit aims to save your old Windows 10 laptop from the scrapheap

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Olivia Rodrigo & Louis Partridge’s Relationship From Beginning to Now – Hollywood Life

Musicians Union Supports Springsteen After Trump’s “Personal Attacks”

David E. Kelley Writing ‘Bonfire of the Vanities’ TV Series for Apple

First Look at Jamie Bell as Duke Shelby

Mercor, a $10 billion AI startup, confirms it was the victim of a major cybersecurity breach

US agencies to monitor drinking water for microplastics, pharmaceuticals

Mercor, a $10 billion AI startup, confirms it was the victim of a major cybersecurity breach

OpenAI acquires technology talk show TBPN in surprise move

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

OpenAI’s new AI Reinforcement Fine-Tuning could transform how scientists use its models

Why hands-on digital skills will define the value of AI

Olivia Rodrigo & Louis Partridge’s Relationship From Beginning to Now – Hollywood Life

Mercor, a $10 billion AI startup, confirms it was the victim of a major cybersecurity breach

‘I was stuck on the overpass with dump trucks all around me’: A mass Baidu robotaxi outage just caused traffic mayhem in China