Leveraging Fractional GPUs on Kubernetes

March 28, 2024

min read

Nikhil Popli

Share this post

https://www.truefoundry.com/blog/leveraging-fractional-gpus-on-kubernetes

URL

Subscribe to our Newsletter

Delivered twice a month

Join AI/ML leaders for the latest on product, community, and GenAI developments

Table of Contents

Lorem Ipsum Dolor

Subscribe to our newsletter

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Subscribe to our newsletter

By clicking Subscribe you're confirming that you agree with our Terms & Conditions.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

By clicking Subscribe you're confirming that you agree with our Terms & Conditions.

Discover More

July 1, 2025

Accelerate Data Processing 30–40× with NVIDIA RAPIDS on TrueFoundry

GPU

Engineering and Product

AutoDeploy: LLM Agent for GenAI Deployments

Engineering and Product

LLMs & GenAI

March 18, 2025

Autopilot: Automating Infrastructure Management for GenAI

Engineering and Product

LLMs & GenAI

November 12, 2024

Benchmarking the TrueFoundry LLM Gateway: it's blazing fast ⚡

LLMs & GenAI

Related Blogs

No items found.

Leveraging Fractional GPUs on Kubernetes

Why Fractional GPUs?

How to use Fractional GPUs?

Prerequisites for Fractional GPU

Add Cloud Integration

Install Latest Version of tfy-gpu-operator

Enable MIG

Azure

GCP

AWS

Enable Timeslicing

Azure

GCP

AWS

Using fractional GPUs in your Service

Deploying with UI

Deploying with Python SDK

Subscribe to our newsletter

Accelerate Data Processing 30–40× with NVIDIA RAPIDS on TrueFoundry

AutoDeploy: LLM Agent for GenAI Deployments

Autopilot: Automating Infrastructure Management for GenAI

Benchmarking the TrueFoundry LLM Gateway: it's blazing fast ⚡

Blazingly fast way to build, track and deploy your models!

Company

Product

Resources

Goodreads

The Complete Guide to AI Gateways and MCP Servers