Huging face researchers are trying to build a more open version of the “reasoning” model of Deepseek

Huging face researchers are trying to build a more open version of the “reasoning” model of Deepseek


Barely a week after Deepseek has released his model of “R1’s” R1 – which sent the markets in a plate – Hugging face researchers are trying to replicate the model from scratch in what they call a search for “knowledge open “.

Embrace the head of research Leandro Von Werra and several engineers of the company have launched Open-R1, a project that tries to build a duplicate of R1 and Open Source all its components, including the data used to train it.

The engineers declared that they were forced to act by the philosophy of release of “Black Box” of Deepseek. Technically, R1 is “open” as the model is permanently authorized, which means that it can be distributed largely without restrictions. However, R1 is not “open source” for the definition widely accepted because some of the tools used to build it are wrapped in mystery. Like many high -altitude artificial intelligence companies, Deepseek is detestable to reveal its secret sauce.

“The R1 model is impressive, but there are no open data sets, experiment details or intermediate models, which makes replication and further research difficult,” he told Techcrunch Elie Bakooch, one of the face engineers embraced on the project Open-R1. “The complete architecture of R1 completely open does not only concern transparency: it is a matter of unlocking its potential.”

Not so open

Deepseek, a Chinese artificial intelligence laboratory financed in part by a quantitative hedge fund, published R1 last week. On a number of reference parameters, the correspondences R1 – and even exceed – the performance of the Openi’s reasoning model.

Being a reasoning model, R1 occurs effectively in itself, which helps it to avoid some of the pitfalls that normally stumble on the models. The reasoning models require a little more time-on usual a few or minutes longer-to get to solutions than a typical non-reduced model. The positive side is that they tend to be more reliable in sectors such as physics, science and mathematics.

R1 broke into the traditional conscience after the DeePseek chatbot app, which provides free access to R1, has risen to the top of Apple’s apps cards. The speed and efficiency with which R1 was developed – Deepseek released the model a few weeks after the release of Openai O1 – has brought many analysts and technologists of Wall Street to wonder if the United States can maintain its advantage in the race AI .

The Open-R1 project is less worried about the US domain compared to “completely open the black model formation box,” Bakouch said to Techcrunch. He observed that, since R1 has not been released with the training code or training instructions, it is difficult to study the model in depth – much less to guide his behavior.

“Having control over the data set and on the process is essential for the distribution of a model responsible in sensitive areas,” said Bakouch. “It also helps to understand and face prejudices in the model. The researchers require more than fragments (…) to push the boundaries of what is possible. “

Steps for the reply

The goal of the Open-R1 project is to replicate R1 in a few weeks, based in part on the embrace Science Cluster, a dedicated search server with 768 GPU Nvidia H100.

The face engineers embrace to touch the scientific cluster to generate sets of data similar to those of Deepseek used to create R1. To build a training pipeline, the team is soliciting the help of the AI ​​and larger technological communities to embrace Face and Github, where the Open-R1 project is hosted.

“We have to make sure to implement the algorithms and the recipes (correctly),” said Von Werra to Techcrunch, “but it is something that a community effort is perfect in facing, where you get as much as possible eyes on the problem.”

There is already a lot of interest. The Open-R1 project collected 10,000 stars in just three days on Github. The stars are a way for Github users to indicate that he likes a project or find it useful.

If the Open-R1 project is successful, artificial intelligence researchers will be able to rely on the training pipeline and work on the development of the next generation of Open Source reasoning models, said Bakouch. He hopes that the Open-R1 project not only produces a strong R1 open source replica, but a basis for the best models to come.

“Rather than being a zero -sum game, Open Source development immediately benefits everyone, including Frontier Labs and models providers, as they can all use the same innovations,” said Bakooch.

While some artificial intelligence experts have raised concerns about the potential for open source artificial intelligence, Bakouch believes that the benefits exceed the risks.

“When the R1 recipe has been replicated, anyone can rent a few GPUs can build their variant of R1 with their own data, further spreading technology everywhere,” he said. “We are really enthusiastic about the recent Open Source releases that are strengthening the role of opening in the AI. It is an important movement for the field that changes the narrative that only a handful of laboratories is able to make progress and that the open source is late. “

Comments

No comments yet. Why don’t you start the discussion?

Leave a Reply

Your email address will not be published. Required fields are marked *