Blog Article:

Building an AI-Ready Enterprise Cloud with OpenNebula

One of our main use cases is enabling the growing number of companies looking to build private and hybrid cloud infrastructures for running AI training and inference services. Compared to relying on public cloud providers, deploying a private cloud for AI brings several key benefits:

Cost Efficiency: Owning and managing infrastructure significantly reduces long-term costs compared to public cloud services.
Data Privacy and Security: Full control over data ensures regulatory compliance and eliminates concerns about third-party access.
Customizability: Tailor the environment to optimize AI frameworks, libraries, and tools for maximum efficiency.
Vendor Neutrality: Avoid vendor lock-in by leveraging open source solutions and diverse hardware options.
Performance Optimization: Fine-tune infrastructure to meet workload-specific requirements.
Reduced Latency: Locally hosted infrastructure minimizes latency for real-time AI inference and data processing.

OpenNebula: Powering Multi-Tenant AI Factories

To support this vision, OpenNebula offers key features for multi-tenant AI cloud environments, including:

Robust Multi-Tenancy: Secure resource sharing across teams while efficiently leveraging GPU acceleration.
High-Performance Hardware Access: Support for SR-IOV and PCI passthrough enables direct hardware access, optimizing GPU-intensive workloads.
True As-a-Service Model: Seamlessly manage on-premise and multi-cloud hybrid AI deployments.

New AI Integration: Deploying LLMs on OpenNebula

Over the past months, we have been working to simplify the deployment and orchestration of Large Language Models (LLMs) from Hugging Face within OpenNebula-powered infrastructure.

As part of OneApps 6.10.0-3, we have released the Ray Appliance, designed for managed AI inference and LLM applications. This integration with Hugging Face allows users to:

Easily deploy AI applications within their cloud infrastructure.
Leverage OpenNebula’s scalability and automation for AI workloads.

The app has been tested using several LLMs, like Llama from Meta, Qwen from Alibaba, Mistralai, EuroLLM, ALIA and many others.

Check out this new screencast showcasing how to automatically deploy AI LLMs from Hugging Face on an OpenNebula-powered cloud using the Ray Appliance from the OpenNebula App Marketplace.

This is the initial version, and we are actively enhancing the appliance to support not just LLMs but also various types of ML models, including classification, sentiment analysis, time-series, and more. Additionally, we are working on providing clear guidelines and examples for training or fine-tuning other models, as well as instructions on how to deploy and use your own models through this appliance.

Watch Our Webinar: Empowering AI in the Cloud

We hosted a webinar where we:

Presented our first Hugging Face integration
Demonstrated the Ray Appliance in action
Answered live questions from the community

WATCH THE RECORDING

What’s Next?

This is just the beginning! We are continuously enhancing our AI capabilities, adding new features, integrations, and partnerships to bring AI to your data center. In particular, the next version of the AI appliance will include:

Support to train, fine-tune, and deploy models seamlessly as part of AI workflows.
Support for deploying vLLMs
Support for exposing the inference endpoints with OpenAI API
Support for using multiple GPUs/vGPUs

Discover how OpenNebula can help you build a powerful, secure, and scalable AI infrastructure. Unlock the full potential of your AI Factory with our open-source private cloud solutions.

GET STARTED

Funded by the Spanish Ministry for Digital Transformation and Civil Service through the ONEnextgen Project (UNICO IPCEI-2023-003), and co-funded by the European Union’s NextGenerationEU through the RRF.

Blog Article:

Building an AI-Ready Enterprise Cloud with OpenNebula

OpenNebula: Powering Multi-Tenant AI Factories

New AI Integration: Deploying LLMs on OpenNebula

Watch Our Webinar: Empowering AI in the Cloud

What’s Next?

Carlos Moral

Feb 13, 2025

Product

0 Comments

Submit a Comment Cancel reply

Related Articles

OpenNebula Achieves SUSE Ready Certification for Rancher Integration

Beyond Kubernetes: What You Really Need for Multi-Tenant Cloud at Scale

Re:Virtualize with OpenNebula 7 – A Smarter Path Beyond VMware

Get timely updates on new releases, events, webinars, cool hacks, and much more!

The Open Source Cloud & Edge Computing Platform.

Company

Documentation

Community