Yotta Shakti Cloud | Explore on Demand Plans & Pricing

Transparent, Scalable Pricing for your AI Workloads

From development to deployment, our pricing adapts to your needs

AI Workspace
Tailored for your AI Workloads, With Flexible Instances.
AI Lab
Designed for your educational needs, purchase as per daily lab requirements.
Bare Metal
Raw power at a predictable cost—optimized for your high-performance AI workloads.
SLURM Clusters
Scalable SLURM Clusters for High-Performance Computing.
Kubernetes Clusters
Kubernetes Clusters Built for Performance, Flexibility, and Growth.
AI Endpoints
Deploy AI models effortlessly with cost-effective, per-usage pricing.
Serverless GPUs
Maximize efficiency with no upfront costs—only pay for what your AI workloads consume.
Microsoft Azure AI Services
Experience Azure AI on Yotta Shakti Cloud with high-performance AI solutions.
Yotta Sarvam AI Services
Yotta Sarvam AI delivers scalable models, seamless deployment, and flexible pricing.
Add-On Services
Explore our add-on services to enhance performance.

AI Workspace Pricing

Shakti Cloud AI Workspace delivers a seamless experience with flexible configurations, offering Virtual Machine to meet diverse AI workload demands.

On Demand GPU Compute

Dedicated GPU Compute

Support Service for AI

Price

VM - 1 x H100

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 1 x Nvidia H100 SXM (80 GB)
vCPU : 24
RAM : 240 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 329/Hour

VM - 2 x H100

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 2 x Nvidia H100 SXM (160 GB)
GPU Interconnect : NVLink
vCPU : 52
RAM : 480 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 658/Hour

VM - 4 x H100

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 4 x Nvidia H100 SXM (320 GB)
GPU Interconnect : NVLink
vCPU : 102
RAM : 960 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 1,315/Hour

VM - 1 x L40S

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 1 x Nvidia L40S (48 GB)
vCPU : 28
RAM : 240 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 128/Hour

VM - 2 x L40S

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 2 x Nvidia L40S (96 GB)
vCPU : 56
RAM : 480 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 255/Hour

VM - 1 x H100

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 1 x Nvidia H100 SXM (80 GB)
vCPU : 24
RAM : 240 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 1,77,958/Month

VM - 2 x H100

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 2 x Nvidia H100 SXM (160 GB)
GPU Interconnect : NVLink
vCPU : 52
RAM : 480 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 3,55,915/Month

VM - 4 x H100

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 4 x Nvidia H100 SXM (320 GB)
GPU Interconnect : NVLink
vCPU : 102
RAM : 960 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 7,11,831/Month

VM - 1 x L40S

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 1 x Nvidia L40S (48 GB)
vCPU : 28
RAM : 240 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 69,035/Month

VM - 2 x L40S

Shakti Cloud AI Workspace-as-a-Service - Virtual Machine
GPU : 2 x Nvidia L40S (96 GB)
vCPU : 56
RAM : 480 GB
Root Disk : 100 GB High Speed Storage
Unlimited Data Ingress & Egress

₹ 1,38,071/Month

Shakti Cloud Kubernetes Management Platform fee

Shakti Cloud Kubernetes Management Platform with bundled Head Nodes with 3 Head Nodes (CPU Node) and 1 Login Node(CPU Node)

₹ 1,76,000/Month

Shakti NVAIE – NVIDIA AI Enterprise Licensing + Lepton Platform

NVIDIA AI Enterprise is a full-stack, cloud-native platform designed to supercharge data science workflows and simplify the development, scaling, and deployment of cutting-edge AI applications, including the latest in generative AI.
Lepton is a high-performance AI orchestration platform that accelerates model development, training, and deployment with full ML tooling, RBAC, benchmarking, and observability.

₹ 7300/GPU/month

Shakti Cloud NVCF Serverless Platform Fee

Shakti Cloud NVCF Platform for the Serverless GPU functionality

₹ 6,420 GPU/ Month

AI Lab Platform Fee

User Access: 10 end-user accounts for development + 2 admin accounts for lab management
Effortless Container Management: Deploy pre-configured ML/DL environments instantly
Flexible GPU Profiles: Slice GPUs for multiple users or combine them for high-performance tasks
Built-in IDEs: Work seamlessly with Jupyter & VSCode
Storage: 250 GB included—allocate per user as needs
Unlimited Data Transfer: No limits on ingress or egress

₹ 70,000/Month

Shakti Cloud SLURM Management Platform fee

Shakti Cloud SLURM Management Platform; includes 1 Head Node (CPU Node) and 1 Login Node(CPU Node)

₹ 85,848/Month

Shakti Cloud SLURM Management Platform (High Availability) fee

Shakti Cloud SLURM Management Platform in High Availability; includes 2 Head Nodes (CPU Node in High Availability) and 1 Login Node(CPU Node)

₹ 1,28,772/Month

Shakti Cloud InfiniBand Interconnectivity

3.2 Tb/s Interconnect – Ultra-low latency, high-bandwidth fabric designed for demanding AI and HPC clusters.

₹ 16,060/GPU/Month

View More Services

AI Lab Pricing

Shakti AI Lab provides an instant, hassle-free AI development environment with pre-loaded libraries, available for users to begin developing immediately.

Pre-Packaged AI Lab

Workstation Name	Description	GPU Memory	Monthly price
AI Lab Workstation - 10GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 10 GB slice of Nvidia H100 CPU Compute: 2 cores of Intel Xeon Platinum 8480+ with 30 GB RAM Dedicated Storage: 50 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	10GB - H100	₹ 25,000 per workstation
AI Lab Workstation – 20GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 20 GB slice of Nvidia H100 CPU Compute: 3 cores of Intel Xeon Platinum 8480+ with 60 GB RAM Dedicated Storage: 100 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	20GB - H100	₹ 50,000 per workstation
AI Lab Workstation – 40GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 40 GB slice of Nvidia H100 CPU Compute: 6 cores of Intel Xeon Platinum 8480+ with 125 GB RAM Dedicated Storage: 200 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	40GB - H100	₹ 1,00,000 per workstation
AI Lab Workstation – 80GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 1x Nvidia H100 with 80 GB GPU memory CPU Compute: 14 cores of Intel Xeon Platinum 8480+ with 250 GB RAM Dedicated Storage: 400 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	80GB - H100	₹ 1,75,000 per workstation
AI Lab Workstation – 160GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 2x Nvidia H100 with 160 GB GPU memory CPU Compute: 28 cores of Intel Xeon Platinum 8480+ with 500 GB RAM Dedicated Storage: 800 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	160GB - H100	₹ 3,50,000 per workstation
AI Lab Workstation - 320GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 4x Nvidia H100 with 320 GB GPU memory CPU Compute: 56 cores of Intel Xeon Platinum 8480+ with 1 TB RAM Dedicated Storage: 1600 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	320GB - H100	₹ 7,00,000 per workstation
AI Lab Workstation – 640GB - H100	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 8x Nvidia H100 with 640 GB GPU memory CPU Compute: 112 cores of Intel Xeon Platinum 8480+ with 2 TB RAM Dedicated Storage: 3200 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	640GB - H100	₹ 14,00,000 per workstation
AI Lab Workstation – 48GB - L40s	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 48 GB slice of Nvidia L40S CPU Compute: 6 cores of Intel Xeon Platinum 8480+ with 125 GB RAM Dedicated Storage: 200 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	48GB - L40s	₹ 74,000 per workstation
AI Lab Workstation - 96GB - L40s	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 2x Nvidia L40S with 96 GB GPU memory CPU Compute: 32 cores of Intel Xeon Gold 6448Y with 500 GB RAM Dedicated Storage: 400 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	96GB - L40s	₹ 1,48,000 per workstation
AI Lab Workstation – 192GB - L40s	Effortless Container Management: Deploy pre-configured ML/DL environments instantly. Built-in IDEs: Work seamlessly with Jupyter & VSCode. GPU Compute: 4x Nvidia L40S with 192 GB GPU memory CPU Compute: 64 cores of Intel Xeon Gold 6448Y with 1 TB RAM Dedicated Storage: 800 GB per workstation for source code or training datasets Unlimited Users: Onboard and manage users without limits Unlimited Access: Available 24/7 with no hourly limits	192GB - L40s	₹ 2,96,000 per workstation

Bare Metal Pricing

Choose from our powerful, dedicated Bare Metal servers, optimized for performance-intensive workloads and tailored to meet your AI training needs.

Bare Metal 8xH100Unit Rate: INR / GPU / Hour

Plan Description	Monthly	6 Months	12 Months	24 Months	36 Months	48 Months
Bare Metal 8 x HGX H100 Processors: 2 x Intel Xeon Platinum 8480+ 2.0 GHz (224 Threads) Memory: 2 TB DDR5 4800 MT/s GPUs: 8 x NVIDIA H100 SXM5 (640 GB GPU Memory) Interconnect: NVLink, NVSwitch System Drive: 2 x 1.92TB NVMe SSD U.2 Gen5 Data Drive: 8 x 7.68TB NVMe SSD U.2 Gen5 DPU: NVIDIA BlueField-3 Networking: 8 x Nvidia ConnectX-7 Single Port NDR InfiniBand HCA OSFP PCIe Network Bandwidth: Unlimited ingress & egress	₹ 325	₹ 313	₹ 243	₹ 234	₹ 226	₹ 217

Plan Description

Monthly

6 Months

12 Months

24 Months

36 Months

48 Months

Bare Metal 8 x HGX H100

Processors: 2 x Intel Xeon Platinum 8480+ 2.0 GHz (224 Threads)
Memory: 2 TB DDR5 4800 MT/s
GPUs: 8 x NVIDIA H100 SXM5 (640 GB GPU Memory)
Interconnect: NVLink, NVSwitch
System Drive: 2 x 1.92TB NVMe SSD U.2 Gen5
Data Drive: 8 x 7.68TB NVMe SSD U.2 Gen5
DPU: NVIDIA BlueField-3
Networking: 8 x Nvidia ConnectX-7 Single Port NDR InfiniBand HCA OSFP PCIe
Network Bandwidth: Unlimited ingress & egress

₹ 325

₹ 313

₹ 243

₹ 234

₹ 226

₹ 217

Bare Metal 4 x L40SUnit Rate: INR / GPU / Hour

Plan Description	Monthly	6 Months	12 Months	24 Months	36 Months	48 Months
Bare Metal 4 x L40S Processors: 2 x Intel Xeon Gold 6448Y @ 2.1 GHz (128 Threads) Memory: 1 TB DDR5 @ 4800 MT/s GPU: 4 x NVIDIA L40S PCIe (Total: 192 GB GPU Memory) System Drive: 1 x 960 GB NVMe U.2 Gen4 Data Drives: 2 x 3.84 TB NVMe U.2 Gen4 Networking: NVIDIA BlueField-3 DPU Bandwidth: Unlimited Free Ingress & Egress	₹ 182	₹ 161	₹ 147	₹ 145	₹ 143	₹ 141

Plan Description

Monthly

6 Months

12 Months

24 Months

36 Months

48 Months

Bare Metal 4 x L40S

Processors: 2 x Intel Xeon Gold 6448Y @ 2.1 GHz (128 Threads)
Memory: 1 TB DDR5 @ 4800 MT/s
GPU: 4 x NVIDIA L40S PCIe (Total: 192 GB GPU Memory)
System Drive: 1 x 960 GB NVMe U.2 Gen4
Data Drives: 2 x 3.84 TB NVMe U.2 Gen4
Networking: NVIDIA BlueField-3 DPU
Bandwidth: Unlimited Free Ingress & Egress

₹ 182

₹ 161

₹ 147

₹ 145

₹ 143

₹ 141

BARE METAL 8 x HGX B200Unit Rate: INR / GPU / Hour

Plan Description	On-Demand	Monthly	6 Months	12 Months	24 Months	36 Months
BARE METAL 8 x HGX B200 Processor: 2x Intel 8570 (56C/112T, 2.1 GHz) Memory: 32x 64 GB DDR5-4800 ECC RDIM System Drive: 2x 1.92TB M.2 NVMe SS Data Drive: 8x 3.8TB U.3 NVMe SS GPUs: 8x NVIDIA HGX B200 GPU NIC (1G): Single – Port 1G Base-T RJ45 Ethernet LAN card DPU: 1x Dual port BlueField®-3 DPU 200Gb/s Networking: 8x Single 400G NDR/ETH OSFP	₹ 527	₹ 492	₹ 466	₹ 439	₹ 351	₹ 307

Plan Description

On-Demand

Monthly

6 Months

12 Months

24 Months

36 Months

BARE METAL 8 x HGX B200

Processor: 2x Intel 8570 (56C/112T, 2.1 GHz)
Memory: 32x 64 GB DDR5-4800 ECC RDIM
System Drive: 2x 1.92TB M.2 NVMe SS
Data Drive: 8x 3.8TB U.3 NVMe SS
GPUs: 8x NVIDIA HGX B200 GPU
NIC (1G): Single – Port 1G Base-T RJ45 Ethernet LAN card
DPU: 1x Dual port BlueField®-3 DPU 200Gb/s
Networking: 8x Single 400G NDR/ETH OSFP

₹ 527

₹ 492

₹ 466

₹ 439

₹ 351

₹ 307

SLURM Clusters Pricing

Run and scale demanding AI, ML, and HPC workloads with precision using our SLURM-managed infrastructure.

H100 SLURM Cluster PricingUnit Rate: INR / GPU / Hour

Plan Description	Monthly	6 Months	12 Months	24 Months	36 Months	48 Months
SLURM Cluster Pricing for HGX H100 8 x HGX H100 (640 GB) GPUs Unlimited Free Ingress & Egress	₹ 357	₹ 345	₹ 267	₹ 257	₹ 249	₹ 239

Plan Description

Monthly

6 Months

12 Months

24 Months

36 Months

48 Months

SLURM Cluster Pricing for HGX H100

8 x HGX H100 (640 GB) GPUs
Unlimited Free Ingress & Egress

₹ 357

₹ 345

₹ 267

₹ 257

₹ 249

₹ 239

L40S SLURM Cluster PricingUnit Rate: INR / GPU / Hour

Plan Description	Monthly	6 Months	12 Months	24 Months	36 Months	48 Months
SLURM Cluster Pricing for L40S 4 x L40S (192 GB) GPUs Unlimited Free Ingress & Egress	₹ 200	₹ 177	₹ 162	₹ 160	₹ 158	₹ 156

Plan Description

Monthly

6 Months

12 Months

24 Months

36 Months

48 Months

SLURM Cluster Pricing for L40S

4 x L40S (192 GB) GPUs
Unlimited Free Ingress & Egress

₹ 200

₹ 177

₹ 162

₹ 160

₹ 158

₹ 156

Kubernetes Clusters Pricing

Leverage Kubernetes orchestration to efficiently deploy and scale your training and inferencing workloads.

H100 Kubernetes Cluster PricingUnit Rate: INR / GPU / Hour

Plan Description	Monthly	6 Months	12 Months	24 Months	36 Months	48 Months
Kubernetes Cluster Pricing for HGX H100 8 x HGX H100 (640 GB) GPUs Unlimited Free Ingress & Egress	₹ 373	₹ 360	₹ 279	₹ 269	₹ 260	₹ 250

Plan Description

Monthly

6 Months

12 Months

24 Months

36 Months

48 Months

Kubernetes Cluster Pricing for HGX H100

8 x HGX H100 (640 GB) GPUs
Unlimited Free Ingress & Egress

₹ 373

₹ 360

₹ 279

₹ 269

₹ 260

₹ 250

L40S Kubernetes Cluster PricingUnit Rate: INR / GPU / Hour

Plan Description	Monthly	6 Months	12 Months	24 Months	36 Months	48 Months
Kubernetes Cluster Pricing for L40S 4 x L40S (192 GB) GPUs Unlimited Free Ingress & Egress	₹ 209	₹ 185	₹ 169	₹ 167	₹ 165	₹ 163

Plan Description

Monthly

6 Months

12 Months

24 Months

36 Months

48 Months

Kubernetes Cluster Pricing for L40S

4 x L40S (192 GB) GPUs
Unlimited Free Ingress & Egress

₹ 209

₹ 185

₹ 169

₹ 167

₹ 165

₹ 163

AI Endpoint Pricing

Select the right endpoint for your project and enjoy transparent, token-based pricing designed to meet your unique AI needs.

Text LLM

Chat

RAG

Healthcare

ASR

Speech to Text

Neural Machine Translation (NMT)

Price

Meta/Llama3-8b-instruct

NVIDIA NIM for GPU accelerated Llama 3 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Llama-3.1-8b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Meta/Llama3-70b-instruct

NVIDIA NIM for GPU accelerated Llama 3 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Llama-3.1-405b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 405B inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

Llama-3.1-70b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Mixtral-8x7B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Mixtral-8x7B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 90 / 1M Tokens

Llama-3.1-8b-base

NVIDIA NIM for GPU accelerated Llama 3.1 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Mixtral-8x22B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Mixtral-8x22B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

meta-llama-2-13b-chat

NVIDIA NIM for GPU accelerated Llama 2 13B inference through OpenAI compatible APIs

₹ 65 / 1M Tokens

meta-llama-2-70b-chat

NVIDIA NIM for GPU accelerated Llama 2 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Llama-3-Taiwan-70B-Instruct

NVIDIA NIM for GPU accelerated Llama-3-Taiwan-70B-Instruct inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

nemotron-4-340b-instruct

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Instruct inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

meta-llama-2-7b-chat

NVIDIA NIM for GPU accelerated Llama 2 7B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Mistral-7B-Instruct-v0.3

NVIDIA NIM for GPU accelerated Mistral-7B-Instruct-v0.3 inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Nemotron-4-340B-Reward

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Reward inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

Llama-3-Swallow-70B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Llama-3-Swalow-70B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Meta/Llama3-8b-instruct

NVIDIA NIM for GPU accelerated Llama 3 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Llama-3.1-8b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Meta/Llama3-70b-instruct

NVIDIA NIM for GPU accelerated Llama 3 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Llama-3.1-405b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 405B inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

Llama-3.1-70b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 70B inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

Mixtral-8x7B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Mixtral-8x7B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 90 / 1M Tokens

Mixtral-8x22B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Mixtral-8x22B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

meta-llama-2-13b-chat

NVIDIA NIM for GPU accelerated Llama 2 13B inference through OpenAI compatible APIs

₹ 65 / 1M Tokens

meta-llama-2-70b-chat

NVIDIA NIM for GPU accelerated Llama 2 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Llama-3-Taiwan-70B-Instruct

NVIDIA NIM for GPU accelerated Llama-3-Taiwan-70B-Instruct inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

nemotron-4-340b-instruct

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Instruct inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

meta-llama-2-7b-chat

NVIDIA NIM for GPU accelerated Llama 2 7B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Mistral-7B-Instruct-v0.3

NVIDIA NIM for GPU accelerated Mistral-7B-Instruct-v0.3 inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Nemotron-4-340B-Reward

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Reward inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

Llama-3-Swallow-70B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Llama-3-Swalow-70B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Meta/Llama3-8b-instruct

NVIDIA NIM for GPU accelerated Llama 3 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Llama-3.1-8b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 8B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Meta/Llama3-70b-instruct

NVIDIA NIM for GPU accelerated Llama 3 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Llama-3.1-405b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 405B inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

Llama-3.1-70b-instruct

NVIDIA NIM for GPU accelerated Llama 3.1 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Mixtral-8x7B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Mixtral-8x7B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 90 / 1M Tokens

Mixtral-8x22B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Mixtral-8x22B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

meta-llama-2-13b-chat

NVIDIA NIM for GPU accelerated Llama 2 13B inference through OpenAI compatible APIs

₹ 65 / 1M Tokens

meta-llama-2-70b-chat

NVIDIA NIM for GPU accelerated Llama 2 70B inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

Llama-3-Taiwan-70B-Instruct

NVIDIA NIM for GPU accelerated Llama-3-Taiwan-70B-Instruct inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

nemotron-4-340b-instruct

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Instruct inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

meta-llama-2-7b-chat

NVIDIA NIM for GPU accelerated Llama 2 7B inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Mistral-7B-Instruct-v0.3

NVIDIA NIM for GPU accelerated Mistral-7B-Instruct-v0.3 inference through OpenAI compatible APIs

₹ 20 / 1M Tokens

Nemotron-4-340B-Reward

NVIDIA NIM for GPU accelerated Nemotron-4-340B-Reward inference through OpenAI compatible APIs

₹ 705 / 1M Tokens

NVIDIA Retrieval QA E5 Embedding v5

NVIDIA NIM for GPU accelerated NVIDIA Retrieval QA E5 Embedding v5 inference

₹ 175 / 1M Tokens

NVIDIA Retrieval QA Mistral 4B Reranking v3

NVIDIA NIM for GPU accelerated NVIDIA Retrieval QA Mistral 4B Reranking v3 inference

₹ 45 / 1M Tokens

NVIDIA Retrieval QA Mistral 7B Embedding v2

NVIDIA NIM for GPU accelerated NVIDIA Retrieval QA Mistral 7B Embedding v2 inference

₹ 65 / 1M Tokens

Llama-3-Swallow-70B-Instruct-v0.1

NVIDIA NIM for GPU accelerated Llama-3-Swalow-70B-Instruct-v0.1 inference through OpenAI compatible APIs

₹ 140 / 1M Tokens

snowflake Arctic Embed Large Embedding

NVIDIA NIM for GPU accelerated Snowflake Arctic Embed Large Embedding inference

₹ 45 / 1M Tokens

MolMIM

MolMIM is a transformer-based model developed by NVIDIA for controlled small molecule generation.

₹ 350 / Per Request

DiffDock

Diffdock predicts the 3D structure of the interaction between a molecule and a protein.

₹ 440 / Per Request

ProteinMPNN

Predicts amino acid sequences from 3D structure of proteins.

₹ 350 / Per Request

AlphaFold2

A widely used model for predicting the 3D structures of proteins from their amino acid sequences.

₹ 175 / Per Request

ASR Parakeet CTC Riva 1.1b

RIVA ASR NIM delivers accurate English speech-to-text transcription and enables easy-to-use optimized ASR inference for large scale deployments.

₹ 175 /Per Request

TTS FastPitch HifiGAN Riva

RIVA TTS NIM provide easy access to state-of-the-art text to speech models, capable of synthesizing English speech from text

₹ 90 / Per Request

NMT Megatron Riva 1b

Riva NMT NIM provide easy access to state-of-the-art neural machine translation (NMT) models, capable of translating text from one language to another with exceptional accuracy.

₹ 45 / 1M Tokens

Serverless GPUs Pricing

Leverage on-demand compute power with pay-as-you-go pricing, perfect for scaling AI applications seamlessly.

Spot (Per Sec)

Flex (Per Month)

Price

Shakti Cloud Serverless 1 x NVIDIA H100 80GB Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 80 GB H100
CPU: 14 Cores
RAM: 256GB
Storage: 500GB
Unlimited ingress and egress.

₹ 0.12

Shakti Cloud Serverless with 1 x 40 GB H100 Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 40 GB H100
CPU: 7 Cores
RAM: 128GB RAM
Storage: 250GB
Unlimited ingress and egress.

₹ 0.06

Shakti Cloud Serverless with 1 x 48 GB L40S Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 48 GB L40S
CPU: 16 Cores
RAM: 256GB
Storage: 500GB
Unlimited ingress and egress

₹ 0.05

Shakti Cloud Serverless with - 1 x 16 GB L40S Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 16 GB L40S
CPU: 4 Cores
RAM: 64GB
Storage: 128GB
Unlimited ingress and egress

₹ 0.02

Shakti Cloud Serverless with 1 x 24 GB L40S Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 24 GB L40S
CPU: 8 Cores
RAM: 128GB
Storage: 258GB
Unlimited ingress and egress

₹ 0.03

Shakti Cloud Serverless 1 x NVIDIA H100 80GB Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 80 GB H100
CPU: 14 Cores
RAM: 256GB
Storage: 500GB
Unlimited ingress and egress.

₹ 2,40,000

Shakti Cloud Serverless with 1 x 40 GB H100 Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 40 GB H100
CPU: 7 Cores
RAM: 128GB RAM
Storage: 250GB
Unlimited ingress and egress.

₹ 1,21,000

Shakti Cloud Serverless with 1 x 48 GB L40S Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 48 GB L40S
CPU: 16 Cores
RAM: 256GB
Storage: 500GB
Unlimited ingress and egress

₹ 1,12,100

Shakti Cloud Serverless with - 1 x 16 GB L40S Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 16 GB L40S
CPU: 4 Cores
RAM: 64GB
Storage: 128GB
Unlimited ingress and egress

₹ 37,385

Shakti Cloud Serverless with 1 x 24 GB L40S Per Sec

Shakti Cloud Serverless Runtime scaling Instance with
GPU: 24 GB L40S
CPU: 8 Cores
RAM: 128GB
Storage: 258GB
Unlimited ingress and egress

₹ 56,080

Microsoft Azure AI Services

Experience the power of Azure AI services on Yotta Shakti Cloud with cost-effective pricing designed for optimized performance for your AI workloads.

Yotta - Microsoft AI Services

Azure Public Cloud AI Services

Product Name	Service	Price (Starting from)	Explore Solutions
Shakti Cloud Azure ML studio	Azure ML Studio running on Shakti Cloud with H100 GPUs.	₹ 264 / GPU / Hr	Know More
Shakti Cloud Azure Database Services	SQL Managed Instance I General Purpose (PaaS)	₹ 23.54 / Core / Hr	Know More
	SQL Managed Instance I Business Critical (PaaS)	₹ 62.48 / Core / Hr	Know More
	PostgreSQL (preview) (General Purpose)	₹ 11.08 / Core / Hr	Know More
	PostgreSQL (preview) (Business Critical)	₹ 12.6 / Core / Hr	Know More
	Azure Arc-enabled SQL Server (Standard Edition)	₹ 8.8 / Core / Hr *(VMs + Licensing cost will be additional)	Know More
	Azure Arc-enabled SQL Server (Enterprise Edition)	₹ 33 / Core / Hr *(VMs + Licensing cost will be additional)	Know More

Services	Price (Starting from)	Explore Solutions
Azure Open AI	₹ 13.2 / million inputs token ₹ 6.6 / million output token	Know More
Relational Database	₹ 11 / Core / Hour	Know More
Azure Open Datasets	₹9.68 / GB (Egress Pricing may apply, Datasets are Free to use)	Know More
Internet of Things	₹ 1232 / Hub Unit (400,000 messages /day of 4 KB size)	Know More
Open-Source Relational Database	₹ 11 / Core / Hr	Know More
Azure Analytics	₹ 12.68 / vCore /Hr	Know More
Integration	₹ 52.8 / Million Operations	Know More
Azure Web Services	₹ 792 / app /month	Know More

Yotta Sarvam AI Services

Harness the capabilities of Yotta Sarvam AI Services with transparent pricing crafted for advanced AI model development, seemless deployment, and scalable performance tailored to your business need.

Sarvam Agent

Sarvam Conversational AI

Sarvam Analytics Services

Sarvam Experts

Name		Price / Per Min
Sarvam Agents		₹ 7.00

Name		Price
Speech- to-text : Saarika		₹ 30.00 per hour of audio
Text-to-speech: Bulbul		₹ 15.00 per 10,000 character
ASR-Translate: Saaras		₹ 30.00 per hour of audio
Mayura		₹ 20.00 per 10,000 characters

Name		Price
Parsing		₹ 10.00 Per Page
Call Analytics		₹ 2.50 Per Minute

Name		Price
Legal Research		₹ 100.00 Per Query
Doc Ops		₹ 3.00 Per Page
Document Chat		₹ 10.00 Per Query
Data Analyst		₹ 100.00 Per Query

Add On Services

Price

High Speed Storage

High Speed Storage for faster read and write data from local and remote

₹ 9.68/GB/Month

Object Storage

Object Storage Buckets to read and transmit data from local and remote

₹ 3.52/GB/Month

Public IP

Public IP for connnectivity

528 per IP

VPN

Virtual Private Network for Security

MRC ₹ 660 OTC ₹ 200

Transparent, Scalable Pricing for your AI Workloads

AI Workspace

AI Lab

Bare Metal

SLURM Clusters

Kubernetes Clusters

AI Endpoints

Serverless GPUs

Microsoft Azure AI Services

Yotta Sarvam AI Services

Add-On Services

AI Workspace Pricing

AI Lab Pricing

Bare Metal Pricing

SLURM Clusters Pricing

Kubernetes Clusters Pricing

AI Endpoint Pricing

Serverless GPUs Pricing

Microsoft Azure AI Services

Yotta Sarvam AI Services

Add On Services

Accelerate AI with Shakti Cloud