This repository has been archived by the owner on Oct 19, 2024. It is now read-only.
Maintenance release.
- Support manual sharding (#816)
- Optimize cross-mesh communication (#773, #798)
- Add publications (#885)
- Support new models: CodeGen, BLOOM-Z (#774, #844)
- Add priority scheduling in the model server (#852)
- Add guidance on strategy inspection (#879)
- Inference stage construction (#793, #799)
- Misc bug fixes (#876, #878, #873)