Krutrim Now Hosts DeepSeek Models On Its Cloud

SUMMARY

Krutrim is now hosting open source AI models of Chinese GenAI company DeepSeek on its cloud platform

Aggarwal further said that the details on its “AI lab, state of the art (SOTA) model and research progress, open source drops on 4th Feb"

Five models of DeepSeek are available on the platform, ranging from 8 Bn tokens to 70 Bn trained models

Bhavish Aggarwal’s AI unicorn Krutrim is now hosting open source AI models of Chinese GenAI company DeepSeek on its cloud platform. 

“Krutrim has accelerated efforts to develop world class AI. As a first step, our cloud now has DeepSeek models live, hosted on Indian servers. Pricing lowest in the world,” Aggarwal said in a post on social media platform X.  

Further hinting on the open source efforts, Aggarwal said that the details on its “AI lab, state of the art (SOTA) model and research progress, open source drops on 4th Feb!”

As of now, five models of DeepSeek are available on the platform, ranging from 8 Bn tokens to 70 Bn trained models. 

Apart from the main model DeepSeek R1, which came into the news for beating OpenAI’s GPT-4o in many benchmarks, Krutrim also hosts distilled versions like DeepSeek-R1-Distill-Llama-8B, DeepSeek-R1-Distill-Qwen-14B, DeepSeek-R1-Distill-Qwen-32B,  DeepSeek-R1-Distill-Llama-70B. 

Overall, DeepSeek has launched six distilled models based on Alibaba and Meta’s open source models Qwen and Llama, respectively. 

Model distillation is a process of transferring knowledge from a large model to a smaller model. DeepSeek says that R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. 

Notably, the R1-Distill-Llama-8B model on Krutrim’s cloud only costs about INR 10 per 1 Mn tokens generated. The costliest DeepSeek model costs about INR 60 per 1 Mn tokens. 

For context, tokens are a fundamental unit of text that LLMs generate. 

Founded in April 2023, Krutrim offers a foundational generative AI model, GPU-as-a-service, model-as-a-service, along with different no code platforms. It became a unicorn in January last year after raising $50 Mn in a round led by a clutch of investors including Matrix Partners India.

The update comes as the recent launch of DeepSeek’s R1 model has shaken the tech industry worldwide, with its model costing about 95% cheaper than OpenAI’s GPT-4 model. Further, while GPT-4 training cost was about $100 Mn, DeepSeek claims that its model cost only under $6 Mn. 

After DeepSeek’s launch, Nvidia’s stock tumbled– along with most of the US tech stocks. 

Meanwhile, IT minister Ashwini Vaishnaw recently said that India is also planning to build a domestic LLM as part of the INR 10,037 Cr IndiaAI Mission. Along with this, he also said 

that India will also host DeepSeek on local servers to address the privacy concerns regarding the cross-border data transfer. 

On the same day, IndiaAI Mission sought proposals for Indian foundational models, which will be shortlisted on factors like innovativeness of the approach, scalability and sustainability, financial viability, ethical considerations, among others.

Under the plan, the Mission will offer “milestone-based disbursements” in the form of a direct grant and compute credits for AI Compute.

You have reached your limit of free stories
Become A Startup Insider With Inc42 Plus

Join our exclusive community of 10,000+ founders, investors & operators and stay ahead in india's startup & business economy.

2 YEAR PLAN
₹19999
₹7999
₹333/Month
UNLOCK 60% OFF
Cancel Anytime
1 YEAR PLAN
₹9999
₹4999
₹416/Month
UNLOCK 50% OFF
Cancel Anytime
Already A Member?
Discover Startups & Business Models

Unleash your potential by exploring unlimited articles, trackers, and playbooks. Identify the hottest startup deals, supercharge your innovation projects, and stay updated with expert curation.

Krutrim Now Hosts DeepSeek Models On Its Cloud-Inc42 Media
How-To’s on Starting & Scaling Up

Empower yourself with comprehensive playbooks, expert analysis, and invaluable insights. Learn to validate ideas, acquire customers, secure funding, and navigate the journey to startup success.

Krutrim Now Hosts DeepSeek Models On Its Cloud-Inc42 Media
Identify Trends & New Markets

Access 75+ in-depth reports on frontier industries. Gain exclusive market intelligence, understand market landscapes, and decode emerging trends to make informed decisions.

Krutrim Now Hosts DeepSeek Models On Its Cloud-Inc42 Media
Track & Decode the Investment Landscape

Stay ahead with startup and funding trackers. Analyse investment strategies, profile successful investors, and keep track of upcoming funds, accelerators, and more.

Krutrim Now Hosts DeepSeek Models On Its Cloud-Inc42 Media
Krutrim Now Hosts DeepSeek Models On Its Cloud-Inc42 Media
You’re in Good company