OpenAI正發布一款新人工智能模型,內部稱為“草莓”,可以執行一些類似人類的推理任務,寄希望于在競爭激烈的市場中保持領先地位。
該公司周四在一篇博文中說,這款名為“o1”的新模型在回應用戶詢問之前,會花更多時間計算答案。有了這個模型,OpenAI的工具應該能夠解決多步驟問題,包括復雜的數學和編碼問題。
該公司表示:“作為一個早期模型,它還不具備很多使ChatGPT變得有用的功能,比如瀏覽網頁獲取信息、上傳文件和圖片等。但對于復雜推理任務來說,這是一項重大進步,代表了人工智能能力的新水平。鑒于此,我們將計數器重置回1,并將該系列命名為OpenAI o1?!?/p>
付費ChatGPT Plus和團隊用戶將于周四通過OpenAI的熱門聊天機器人訪問該模型的預覽版。彭博社此前報道稱,該公司最快將于本周發布這款新模型。
該模型發布之際,總部位于舊金山的OpenAI正尋求籌集數十億美元的資金,并在開發更復雜的人工智能系統的競賽中面臨著日益激烈的競爭。OpenAI并不是唯一一家致力于開發此類功能的公司;競爭對手Anthropic和谷歌也在其先進的人工智能模型中標榜了“推理”能力。
OpenAI在其博文中舉例說明了該人工智能模型對編碼、英語和數學等主題的問題的回答,并要求它解決一個簡單的填字游戲。OpenAI的研究科學家諾姆·布朗(Noam Brown)在X上發表了一系列文章,表示該公如今發布這個模型的預覽版,部分原因是為了了解人們是如何使用它的,以及它在哪些方面需要改進。
使用OpenAI更新的人工智能系統的體驗將與人們對該公司聊天機器人ChatGPT的期望有所不同。在對用戶的提示做出回應之前,新軟件會暫停幾秒鐘,在用戶看不見的幕后,它會考慮一些相關的提示,然后總結出似乎是最好的答案。這種技術有時被稱為“思維鏈”提示。
一段時間以來,OpenAI一直致力于讓計算機執行多步驟操作。例如,在2023年5月,該公司發布了一篇博文和一篇隨附的研究論文,介紹了其為提高人工智能系統解決數學問題的能力所做的努力。根據這篇論文,該公司訓練一個模型的方法是,獎勵它在得出答案過程中的每一個正確步驟,而不僅僅是獎勵它生成了準確答案。(財富中文網)
譯者:中慧言-王芳
OpenAI正發布一款新人工智能模型,內部稱為“草莓”,可以執行一些類似人類的推理任務,寄希望于在競爭激烈的市場中保持領先地位。
該公司周四在一篇博文中說,這款名為“o1”的新模型在回應用戶詢問之前,會花更多時間計算答案。有了這個模型,OpenAI的工具應該能夠解決多步驟問題,包括復雜的數學和編碼問題。
該公司表示:“作為一個早期模型,它還不具備很多使ChatGPT變得有用的功能,比如瀏覽網頁獲取信息、上傳文件和圖片等。但對于復雜推理任務來說,這是一項重大進步,代表了人工智能能力的新水平。鑒于此,我們將計數器重置回1,并將該系列命名為OpenAI o1。”
付費ChatGPT Plus和團隊用戶將于周四通過OpenAI的熱門聊天機器人訪問該模型的預覽版。彭博社此前報道稱,該公司最快將于本周發布這款新模型。
該模型發布之際,總部位于舊金山的OpenAI正尋求籌集數十億美元的資金,并在開發更復雜的人工智能系統的競賽中面臨著日益激烈的競爭。OpenAI并不是唯一一家致力于開發此類功能的公司;競爭對手Anthropic和谷歌也在其先進的人工智能模型中標榜了“推理”能力。
OpenAI在其博文中舉例說明了該人工智能模型對編碼、英語和數學等主題的問題的回答,并要求它解決一個簡單的填字游戲。OpenAI的研究科學家諾姆·布朗(Noam Brown)在X上發表了一系列文章,表示該公如今發布這個模型的預覽版,部分原因是為了了解人們是如何使用它的,以及它在哪些方面需要改進。
使用OpenAI更新的人工智能系統的體驗將與人們對該公司聊天機器人ChatGPT的期望有所不同。在對用戶的提示做出回應之前,新軟件會暫停幾秒鐘,在用戶看不見的幕后,它會考慮一些相關的提示,然后總結出似乎是最好的答案。這種技術有時被稱為“思維鏈”提示。
一段時間以來,OpenAI一直致力于讓計算機執行多步驟操作。例如,在2023年5月,該公司發布了一篇博文和一篇隨附的研究論文,介紹了其為提高人工智能系統解決數學問題的能力所做的努力。根據這篇論文,該公司訓練一個模型的方法是,獎勵它在得出答案過程中的每一個正確步驟,而不僅僅是獎勵它生成了準確答案。(財富中文網)
譯者:中慧言-王芳
OpenAI is releasing a new artificial intelligence model known internally as “Strawberry” that can perform some human-like reasoning tasks, as it looks to stay at the top of a crowded market of rivals.
The new model, called o1, is designed to spend more time computing the answer before responding to user queries, the company said in a blog post Thursday. With the model, OpenAI’s tools should be able to solve multi-step problems, including complicated math and coding questions.
“As an early model, it doesn’t yet have many of the features that make ChatGPT useful, like browsing the web for information and uploading files and images,” the company said. “But for complex reasoning tasks this is a significant advancement and represents a new level of AI capability. Given this, we are resetting the counter back to 1 and naming this series OpenAI o1.”
A preview version of the model will be available through OpenAI’s popular chatbot, ChatGPT, to paid Plus and Team users on Thursday. Bloomberg previously reported the company could release the new model as soon as this week.
The model’s release comes as San Francisco-based OpenAI is looking to raise billions in funding and faces heightened competition in the race to develop ever more sophisticated artificial intelligence systems. OpenAI isn’t the only company working on such capabilities; competitors Anthropic and Google have also touted “reasoning” skills with their advanced AI models.
In its blog post, OpenAI gave examples of the AI model’s responses to questions on topics including coding, English, and math, and asked it to solve a simple crossword puzzle. In a series of posts on X, Noam Brown, a research scientist at OpenAI, said the company is releasing the model in preview now in part to get a sense for how people use it, and where it needs to be improved.
The experience of using OpenAI’s updated AI system will differ somewhat from what people have come to expect with ChatGPT, the company’s chatbot. Before responding to a user’s prompt, the new software will pause for a matter of seconds while, behind the scenes and invisible to the user, it considers a number of related prompts and then summarizes what appears to be the best response. This technique is sometimes referred to as “chain of thought” prompting.
OpenAI has been working to get computers to carry out multi-step actions for some time. In May 2023, for instance, the company released a blog post and an accompanying research paper about its efforts to improve AI systems’ abilities to solve math problems. According to the paper, the company trained a model by rewarding it for each correct step in the process toward coming up with an answer to a problem, rather than by just rewarding it for generating an accurate answer.