Skip to main content
🎉
llm-d 0.5 is now released!
Check out hierarchical KV offloading, cache-aware LoRA routing, resilient networking with UCCL, and scale-to-zero autoscaling.
Read the announcement →
Architecture
Guides
Usage
Community
Blog
Videos
Slack
Join Slack
Authors
RedHat
0
Robert Shaw
6
Director of Engineering, Red Hat
Clayton Coleman
6
Distinguished Engineer, Google
Carlos Costa
5
Distinguished Engineer, IBM
Pete Cheslock
2
AI Community Architect, Red Hat
Christopher Nuland
0
Principal Technical Marketing Manager for AI, Red Hat
Nili Guy
1
R&D Manager, AI Infrastructure, IBM
Etai Lev Ran
1
Cloud Architect, IBM
Vita Bortnikov
1
IBM Fellow, IBM
Maroon Ayoub
1
Research Scientist & Architect, IBM
Danny Harnik
1
Senior Technical Staff Member, IBM
Tyler Smith
1
Member of Technical Staff, Red Hat
Kellen Swain
1
Software Engineer, Google
Xining Wang
1
Senior Technical Expert, Alibaba Cloud
Hang Yin
1
Senior R&D Engineer, Alibaba Cloud
Kay Yan
1
Principal Software Engineer, DaoCloud
Kyle Bader
0
Chief Architect, Data and AI, Ceph at IBM
Tushar Gohad
0
Distinguished Engineer, Intel