More

    SambaNova hits 198 tokens per second on the full, non-distilled DeepSeek-R1 671B with only 16 SN40L RDU chips



    • SambaNova runs DeepSeek-R1 at 198 tokens/sec using 16 custom chips
    • The SN40L RDU chip is reportedly 3X faster, 5X more efficient than GPUs
    • 5X speed boost is promised soon, with 100X capacity by year-end on cloud

    Chinese AI upstart DeepSeek has very quickly made a name for itself in 2025, with its R1 large-scale open source language model, built for advanced reasoning tasks, showing performance on par with the industry’s top models, while being more cost-efficient.

    SambaNova Systems, an AI startup founded in 2017 by experts from Sun/Oracle and Stanford University, has now announced what it claims is the world’s fastest deployment of the DeepSeek-R1 671B LLM to date.

    https://cdn.mos.cms.futurecdn.net/3G8buE7zAcv4a7ZP3jYK6P-1200-80.jpg



    Source link
    waynewilliams@onmail.com (Wayne Williams)

    Latest articles

    spot_imgspot_img

    Related articles

    Leave a reply

    Please enter your comment!
    Please enter your name here

    spot_imgspot_img