Keynote
- Speakers:
- Laura Schaffer, Head of Growth & Marketing, DigitalOceanPaddy Srinivasan, CEO, DigitalOceanVinay Kumar, CPTO, DigitalOcean
The Conference for the Inference Era
Thank you to our amazing speakers, our generous sponsors, and everyone who joined us at Deploy 2026. Learn about all of our product updates announced at Deploy here.70% of AI spend is now inference. See how Character.AI partnered with Inferact and DigitalOcean to cut inference costs by 50%, while improving throughput, on AMD GPUs.
Scaling inference isn't a model problem. It's a decisions problem. Industry leaders from Workato Research Lab, ISMG, and Hippocratic AI share the decisions, tradeoffs, and investments that got them to production AI at scale.
Kari Briski, VP Gen AI, NVIDIA, and Salman Paracha, SVP AI, DigitalOcean discuss why AI-native teams are demanding openness, model flexibility, and infrastructure built for agents that never sleep — and how the convergence of NVIDIA's software layer and DigitalOcean's platform is designed to meet exactly that moment.
Your model isn't failing because it can't reason. It's failing because it doesn't have the right information at the right time. In this session, we'll dig into the data layer: the part of your AI stack most teams treat as an afterthought and then scramble to fix in production.
Everyone has access to the same models. So what actually matters? It's everything around them – routing requests to the right model, connecting to live data, scaling from prototype to production without ripping your code apart. We'll walk through the full journey: from a single API call to intelligent routing across GPU fleets, showing what it looks like when the platform owns the stack end to end. Live demos included.
Early-stage founders building real AI companies today—what they’re solving, what’s getting in the way, and how they’re pushing through it.
Leading investors break down the economics of scaling AI in production, from infrastructure bottlenecks to open vs. closed ecosystems, and share their predictions for what the AI industry will look like in five years.
Deploy 2026 was hosted in person at Convene 100 Stockton, 40 O'Farrell St, San Francisco. The mainstage keynote was streamed live to registrants.
Deploy is designed for teams responsible for managing or building AI workloads in production at scale.
Qualifying participants got $5,000 in promotional inference cloud credits when they deploy a qualifying AI workload. Terms Apply.
This is a special Deploy that represents an evolution of cloud infrastructure that will change the way companies with AI in production conceive of their businesses. DigitalOcean's vertically integrated agentic inference cloud delivers radically simple operations and predictable unit economics that will set AI-natives on a path to success and growth.
No. Deploy is free to attend.
Qualifying participants got $5,000 in promotional inference cloud credits when they deploy a qualifying AI workload. Terms Apply.
Yes. Deploy follows the DigitalOcean Community Code of Conduct.