How Many Users Can Your LLM Server Really Handle?
Deploying large language models (LLMs) in an enterprise environment has transitioned from a proof-of-concept exercise to a rigorous engineering discipline. Yet, accurately predicting the capacity...
Enrique Corro is a data science engineer with nearly 20 years of experience bridging enterprise infrastructure and machine learning innovation. With a master's degree in data science from the University of Illinois and advanced certifications from Stanford's Artificial Intelligence Professional Program, he specializes in deploying production-grade ML systems and driving AI adoption at scale. Currently pursuing Johns Hopkins University's Certificate in Agentic AI, Enrique stays at the forefront of emerging technologies including LLM fine-tuning, agentic systems, and distributed training. His work focuses on making sophisticated ML capabilities—from model optimization to multi-GPU training—accessible and practical for enterprise deployments. At VMware by Broadcom's Cloud Foundation division, he helps organizations leverage advanced machine learning while advancing AI integration across products and operations.
Deploying large language models (LLMs) in an enterprise environment has transitioned from a proof-of-concept exercise to a rigorous engineering discipline. Yet, accurately predicting the capacity...