Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

- Efficient and optimized all-to-all communication - Both intranode and internode support with NVLink and RDMA - High-throughput kernels for training and inference prefilling - Low-latency kernels for inference decoding - Native FP8 dispatch support - Flexible GPU resource control for computation-communication overlapping X: https://x.com/deepseek_ai/status/1894211757604049133


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: