Deepseek: everything you need to know about the chatbot app

Deepseek: everything you need to know about the chatbot app


Deepseek has become viral.

Chinese lab at Lab Deepseek entered the traditional conscience this week after its chatbot app rose to the top of the Apple Apple stores (and also Google Play). The models AI of Deepseek, which were trained using calculation efficiency techniques, have guided the analysts of Wall Street-and the technologists-to wonder if the United States can maintain its advantage in the AI ​​race and if the demand for chips to the support.

But where does Deepsek come from and how did he go up to international fame so quickly?

Origins of the Trader of Deepseek

Deepseek is supported by High-Flyer Capital Management, a Chinese quantitative Hedge Fund that uses to inform its trading decisions.

The enthusiasm of the Liang Wenfeng co-founded High-Flyer in 2015. Wenfeng, who according to what reported in the trading while a student at Zhejiang University, launched High-Flyer Capital Management as Hedge Fund in 2019 focused on the development and implementation of artificial intelligence algorithms.

In 2023, High-Flyer began Deepseek as a laboratory dedicated to the search for artificial intelligence tools separated from its financial activity. With High-Flyer as one of its investors, the laboratory has turned into its own company, she also called Deepseek.

From the first day, Deepseek has built its data center clusters for the formation of the model. But like other artificial intelligence companies in China, Deepseek has been affected by the American export bans on hardware. To form one of its most recent models, the company has been forced to use the NVIDIA H800 chips, a less powerful version of a chip, the H100, available for US companies.

Techcrunch event

Berkeley, ca.
|
June 5th

Book now

Deepseek technical team is said to distort young people. According to reports, the company recruits aggressively doctoral researchers of the best Chinese universities. Deepseek also takes people without any computer background to help his technology better understand a wide range of topics, according to the New York Times.

The strong models of Deepseek

Deepseek revealed his first series of programmer-deepseek models, deepseek llm and deepseek chat-in November 2023. But it was not until last spring, when the start published his family of next generation Deepseek-V2 models, which the artificial intelligence industry has started to take note of it.

Deepseek-V2, a text and image analysis system for the general purpose, has worked well in various artificial intelligence benchmark-and was much cheaper to perform than the comparable models at the moment. He forced the national competition of Deepseek, including Bytedance and Alibaba, to cut the use prices for some of their models and make others completely free.

Deepseek-V3, launched in December 2024, was added only to Deepseek’s reputation.

According to the internal reference tests of Deepseek, Deepseek V3 surpasses both downloadable models, available openly such as the Meta Lama and the “closed” models that can only be accessed through an API, such as Openi GPT-4O.

Equally impressive is the model of “reasoning” of Deepseek. Released in January, Deepseek says that R1 performs and the O1 model of Openai on the key parameters.

Being a reasoning model, R1 occurs effectively itself, which helps it to avoid some of the pitfalls that normally stumble on the models. The reasoning models require a little more time-on usual a few or minutes longer-to get to solutions than a typical non-reduced model. The positive side is that they tend to be more reliable in sectors such as physics, science and mathematics.

There is a negative aspect for R1, Deepseek V3 and the other Deepseek models. Being developed in Chinese, they are subject to benchmarking by the Chinese regulator of the Internet to ensure that its responses “embodies the fundamental socialist values”. In the DeePseek chatbot app, for example, R1 will not answer questions about Tiananmen Square or Taiwan’s autonomy.

In March, Deepek exceeded 16.5 million visits. “(F) or March, Deepseek is in second place, despite having seen the 25% traffic fall from where he was in February, according to daily visits,” David Carr, editor of Faroweb told Techcrunch. Lime still compared to Chatgpt, which exceeded over 500 million active weekly users in March.

A disruptive approach

If Deepseek has a business model, it is not clear what that model is exactly. The company prices its products and services well below the market value and offers others for free. In addition, it is not taking money on investors, despite a ton of VC interest.

The way Deepseek says so, the efficiency discoveries have allowed him to maintain the competitiveness of the extreme cost. However, some experts contest the figures that the company has provided.

Whatever the case, the developers have taken the Deepseek models, which are not open source as the phrase is commonly understood but are available in permissive licenses that allow commercial use. According to Clem Delague, the CEO of Hugging Face, one of the platforms that houses the Deepseek models, the developers on Hugging Face have created over 500 “derivatives” models of R1 which have collected 2.5 million combined downloads.

Deepseek’s success against the biggest and most established rivals has been described as “to overturning” and “too hypothesized”. The success of the company was at least partially responsible for the fact that the price of the Nvidia shares decreases by 18% in January and for having aroused a public response from the CEO of Openi Sam Altman. In March, the offices of the United States Department of Commerce reported to the staff that Deepseek will be prohibited on their government devices, according to Reuters.

Microsoft has announced that Deepseek is available on his Azure Ai Foundry service, the Microsoft platform that brings together artificial intelligence services for companies under a single banner. As a question about Deepseek’s impact on the artificial intelligence expenditure of the destination during his call on the profits of the first quarter, CEO Mark Zuckerberg said that the expenditure for the artificial intelligence infrastructure will continue to be a “strategic advantage” by goal. In March, Openi defined Deepseek “state subsidized by the state” and “controlled by the State” and recommends the United States government to consider to prohibit models from Deepseek.

During the call of profits of the fourth quarter of Nvidia, CEO Jensen Huang underlined “the excellent innovation” of Deepseek, saying that the “reasoning” models and other “reasoning” models are excellent for Nvidia because they need much more calculation.

At the same time, some companies are prohibiting Deepseek, as well as entire countries and governments, including South Korea. New York State has also banned Deepseek to be used on government devices.

In May, Microsoft’s vice -president and President Brad Smith said in an audition of the Senate that Microsoft employees are not authorized to use Deepseek due to data security and propaganda concerns.

As for what could contain the future of Deepseek, it is not clear. The improved models are a data. But the United States government seems to be wary of what it perceives as a harmful foreign influence. In March, the Wall Street Journal reported that the United States will probably forbid Deepseek on government devices.

This story was originally published on January 28, 2025 and will be updated regularly.

Comments

No comments yet. Why don’t you start the discussion?

Leave a Reply

Your email address will not be published. Required fields are marked *