DeepSeek

From GM-RKB
Jump to navigation Jump to search

A DeepSeek is a ... that ...



References

2024

  • (Wikipedia, 2024) ⇒ https://en.wikipedia.org/wiki/High-Flyer_(company)#DeepSeek Retrieved:2024-12-27.
    • In April 2023, High-Flyer announced it would form a new research body to explore the essence of artificial general intelligence. However it would not be used to perform stock trading.[1] This organization would be called DeepSeek.[2]
    • In late 2023, DeepSeek released an open source LLM named DeepSeek after the organization name.[3]
    • In June 2024, DeepSeek V2 was launched. Financial Times reported that it was cheaper than its peers with a price of 2 RMB for every million output tokens. University of Waterloo Tiger Lab’s leaderboard ranked DeepSeek-V2 seventh on its LLM ranking.[4]
    • In November 2024, a preview of DeepSeek R1-Lite was released which claimed to have exceeded the performance of OpenAI o1.[5]
    • In December 2024, DeepSeek V3 was launched. It came with 671 billion parameters and trained in around two months at a cost of US$5.58 million using significantly less resources compared to its peers. It was trained on a dataset of 14.8 trillion tokens. Benchmark tests showed it outperformed Llama 3.1 and Qwen 2.5 while matching GPT-4o and Claude 3.5 Sonnet.[6][7][8]

  1. "[Exclusive Chinese Quant Hedge Fund High-Flyer Won't Use AGI to Trade Stocks, MD Says"] (in en). https://www.yicaiglobal.com/news/exclusive-chinese-quant-fund-high-flyer-will-not-use-agi-to-trade-stocks-managing-director-says. Retrieved 2023-12-31. 
  2. Ottinger, Lily (December 9, 2024). "Deepseek: From Hedge Fund to Frontier Model Maker" (in en). https://www.chinatalk.media/p/deepseek-from-hedge-fund-to-frontier. Retrieved 2024-12-27. 
  3. Se, Ksenia (August 28, 2024). "Inside DeepSeek Models" (in en). https://www.turingpost.com/p/deepseek. Retrieved 2024-11-26. 
  4. Cite error: Invalid <ref> tag; no text was provided for refs named FT
  5. Franzen, Carl (2024-11-20). "DeepSeek’s first reasoning model R1-Lite-Preview turns heads, beating OpenAI o1 performance" (in en-US). https://venturebeat.com/ai/deepseeks-first-reasoning-model-r1-lite-preview-turns-heads-beating-openai-o1-performance/. Retrieved 2024-11-26. 
  6. Jiang, Ben (2024-12-27). "Chinese start-up DeepSeek’s new AI model outperforms Meta, OpenAI products" (in en). https://www.scmp.com/tech/tech-trends/article/3292507/chinese-start-deepseek-launches-ai-model-outperforms-meta-openai-products. Retrieved 2024-12-27. 
  7. Wiggers, Kyle (26 December 2024). "DeepSeek's new AI model appears to be one of the best 'open' challengers yet". https://techcrunch.com/2024/12/26/deepseeks-new-ai-model-appears-to-be-one-of-the-best-open-challengers-yet/. 
  8. Sharma, Shubham (26 December 2024). "DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch". https://venturebeat.com/ai/deepseek-v3-ultra-large-open-source-ai-outperforms-llama-and-qwen-on-launch/.