Home / Cloud Service Models / Cisco Simplifies AI Deployments with New UCS Server and AI Pods

Cisco Simplifies AI Deployments with New UCS Server and AI Pods

Oct 30, 2024

Marcus BaileyAI & Cloud Specialist

Cisco has recently announced two transformative innovations aimed at simplifying and enhancing AI infrastructure deployments: the UCS C885A M8 server and preconfigured infrastructure stacks known as AI Pods. This announcement addresses the increasing demands of organizations dealing with complex AI workloads by streamlining the deployment process. As enterprises continue to integrate artificial intelligence into their operations, the need for more efficient, reliable, and scalable infrastructure becomes paramount. Cisco’s latest offerings are designed to meet these needs head-on, promising a holistic approach to AI infrastructure that can significantly reduce the complexity and cost associated with such deployments.

By incorporating advanced features and capabilities, the UCS C885A M8 server and AI Pods offer a comprehensive solution for organizations looking to leverage AI for diverse applications, ranging from large-scale data analyses to real-time inferencing at the edge. These innovations also reflect the growing strategic partnership between Cisco and Nvidia, combining their expertise to deliver cutting-edge solutions that streamline AI operations and boost performance. This partnership not only enhances the robustness of Cisco’s AI infrastructure solutions but also helps enterprises harness the full potential of AI technologies more effectively.

Introducing the UCS C885A M8 Server

At the core of these advancements is the UCS C885A M8, an 8U rack server built on the Nvidia HGX platform. This server is specifically designed to handle intensive AI tasks such as training large language models (LLM), model fine-tuning, and retrieval-augmented generation (RAG). Equipped with up to 8 high-density Nvidia #00 or ##00 Tensor Core GPUs, or AMD MI300X OAM GPUs, the UCS C885A M8 provides unparalleled acceleration for AI workloads. Its versatile architecture ensures that it can meet the performance demands of a wide range of AI applications, making it an ideal choice for enterprises looking to scale their AI operations efficiently.

Cisco’s collaboration with Nvidia has deepened as they now offer integrated software and hardware solutions that enhance the robustness of AI infrastructures. The inclusion of Nvidia BlueField-3 DPUs in the server facilitates seamless GPU data access across dense GPU server clusters, boosting overall AI networking performance. This level of integration is crucial for enterprises that rely on swift data processing and real-time analytics, as it ensures minimal latency and maximizes throughput. The UCS C885A M8 server represents a significant leap forward in AI infrastructure, providing a solid foundation for future advancements and supporting the ever-growing computational needs of AI-driven enterprises.

A standout feature of the new server is its management through Cisco Intersight, a SaaS solution offering comprehensive management capabilities. Intersight allows administrators to oversee Kubernetes containers, applications, servers, and hyperconverged environments from a single interface. This unified approach simplifies the complexity that enterprises often face when managing diverse computing resources. By providing a centralized platform for monitoring and managing various components of the AI infrastructure, Cisco Intersight helps organizations maintain optimal performance, enhance security, and reduce operational costs. This feature is especially beneficial for enterprises with extensive AI deployments, as it ensures smooth and efficient operations across different environments.

The UCS C885A M8 also benefits from Cisco’s existing M7 UCS rack and blade servers, such as the UCS X-Series, integrated with Nvidia’s Tensor Core GPUs. These solutions cater to AI and data-heavy workloads, both in data centers and edge environments, making them versatile options for various organizational needs. The seamless integration of these technologies underscores Cisco’s commitment to delivering comprehensive AI solutions that address the diverse requirements of modern enterprises. By combining cutting-edge hardware with advanced software capabilities, Cisco’s AI infrastructure offerings provide the reliability, scalability, and flexibility that organizations need to stay competitive in an increasingly AI-driven world.

Enhanced Management and Integration

A crucial aspect of Cisco’s new AI infrastructure offerings is the enhanced management and integration capabilities provided by Cisco Intersight. This SaaS platform offers comprehensive management features that simplify the oversight of complex AI deployments. Intersight allows administrators to manage Kubernetes containers, applications, servers, and hyperconverged environments from a single interface, streamlining the operational workflow and reducing the administrative burden. This level of integration is vital for organizations with extensive AI infrastructures, as it ensures efficient resource utilization, minimizes downtime, and enhances overall system performance.

Intersight’s unified management approach addresses one of the significant challenges enterprises face when deploying AI at scale: operational complexity. By providing a centralized platform for monitoring, managing, and optimizing various components of the AI infrastructure, Cisco Intersight helps enterprises maintain optimal performance and security. This capability is particularly important as AI deployments become more widespread and complex, requiring robust management solutions to ensure seamless operations. Additionally, Intersight’s ability to manage both data center and edge environments makes it a versatile tool for organizations looking to leverage AI across different use cases and scenarios.

Complementing the UCS C885A M8 server, Cisco introduced the Nexus 9364E-SG2 switch, a high-density 800G aggregation switch designed to support the immense data flow generated during AI model training. This switch supports port speeds ranging from 100 Gbps to 400 Gbps, ensuring high bandwidth and minimal latency for AI operations. The use of Open System Form Factor Plus (OSPF) and Quad Small Form Factor Pluggable Double Density (QSPF-DD) facilitates high-speed optical network connections, making the Nexus 9364E-SG2 an integral part of Cisco’s AI infrastructure solution. This switch is engineered to handle the demanding requirements of AI workloads, providing the performance and reliability needed to support large-scale AI deployments.

The Nexus 9364E-SG2 switch also addresses the critical need for advanced congestion management and large buffer sizes, essential for maintaining low latency and high performance during AI operations. These capabilities ensure that AI model training processes are efficient and effective, reducing the risk of bottlenecks and performance degradation. By integrating the Nexus 9364E-SG2 switch with the UCS C885A M8 server, Cisco offers a comprehensive AI infrastructure solution that can meet the diverse needs of modern enterprises. This combination of powerful hardware and advanced management tools provides a solid foundation for organizations looking to harness the full potential of AI technologies.

Simplifying Deployment with AI Pods

To further simplify AI infrastructure deployment, Cisco has introduced AI Pods. These preconfigured and validated infrastructure packages are designed for easy integration into data center or edge environments, offering a straightforward solution for enterprises looking to implement AI without the usual complexities. Based on Cisco Validated Design principles, AI Pods provide a blueprint for building reliable, scalable, and secure network infrastructures, ensuring predictable performance and outcomes. These pre-sized bundles eliminate the guesswork involved in deploying edge inferencing and large-scale AI clusters, making it easier for organizations to achieve their AI goals.

AI Pods come equipped with Nvidia AI Enterprise, offering pretrained models and development tools that are ready for production. Managed through Cisco Intersight, these Pods provide an end-to-end managed solution for AI inferencing, streamlining the deployment process and reducing the time to production. This level of integration and validation ensures that AI Pods can meet the diverse needs of enterprises, from training sophisticated AI models to deploying them for real-world inferencing tasks. By providing a ready-made solution, AI Pods simplify the AI deployment process, making it more accessible and cost-effective for organizations of all sizes.

These advancements address critical infrastructure needs that many enterprises face today. A significant barrier to AI adoption has been the necessity for complete infrastructure overhauls, the development of new skills, and additional costs for security and management. According to the Cisco AI Readiness Index, only 14% of organizations currently have infrastructures ready for AI, with 85% of projects stalling or facing disruptions. Cisco’s new offerings aim to lower these barriers by providing integrated hardware-software packages and streamlined management solutions, making AI infrastructure more predictable and cost-effective.

By simplifying AI deployment with solutions like the UCS C885A M8 server and AI Pods, Cisco is helping enterprises overcome the challenges of AI adoption. These offerings provide the tools and resources necessary to implement AI efficiently, ensuring that organizations can leverage the benefits of AI without the usual complexities and costs. The UCS C885A M8 server and AI Pods are positioned to be crucial tools in helping organizations achieve their AI goals, providing a solid foundation for future innovation and growth. With these new solutions, Cisco is paving the way for more widespread AI adoption in enterprise environments.

Addressing Enterprise Challenges

Cisco has unveiled two groundbreaking innovations to streamline and optimize AI infrastructure deployments: the UCS C885A M8 server and AI Pods, which are preconfigured infrastructure stacks. These advancements are a direct response to the growing complexities organizations face when managing AI workloads. As businesses increasingly adopt artificial intelligence, there’s a pressing need for infrastructure that is efficient, reliable, and scalable. Cisco’s new offerings are tailored to address these needs, aiming to simplify deployment processes while cutting down on associated costs and complexities.

The UCS C885A M8 server and AI Pods are equipped with advanced features designed to support various AI applications, from extensive data analysis to edge-based real-time inferencing. These innovations also highlight Cisco’s deepening alliance with Nvidia, leveraging both companies’ strengths to deliver powerful AI solutions. This partnership significantly enhances the robustness of Cisco’s AI offerings, making it easier for enterprises to fully utilize AI technologies. By combining expertise, Cisco and Nvidia aim to streamline AI operations and deliver superior performance, allowing businesses to maximize their AI initiatives effectively.