Supporting very large models using automatic dataflow graph partitioning M Wang, C Huang, J Li Proceedings of the Fourteenth EuroSys Conference 2019, 1-17, 2019 | 175 | 2019 |
SwapAdvisor: Pushing Deep Learning Beyond the GPU Memory Limit via Smart Swapping CC Huang, G Jin, J Li Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020 | 174 | 2020 |
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel Y Zhao, A Gu, R Varma, L Luo, CC Huang, M Xu, L Wright, H Shojanazeri, ... arXiv preprint arXiv:2304.11277, 2023 | 146 | 2023 |
Spartan: A distributed array framework with smart tiling CC Huang, Q Chen, Z Wang, R Power, J Ortiz, J Li, Z Xiao 2015 {USENIX} Annual Technical Conference ({USENIX}{ATC} 15), 1-15, 2015 | 37 | 2015 |
Unifying Data, Model and Hybrid Parallelism in Deep Learning via Tensor Tiling M Wang, C Huang, J Li arXiv preprint arXiv:1805.04170, 2018 | 28 | 2018 |
Enhancing microkernel performance on VLIW DSP processors via multiset context switch KY Hsieh, YC Lin, CC Huang, JK Lee Journal of Signal Processing Systems 51 (3), 257-268, 2008 | 21 | 2008 |
Garbage collection for multiversion index in flash-based embedded databases PC Huang, YH Chang, KY Lam, JT Wang, CC Huang ACM Transactions on Design Automation of Electronic Systems (TODAES) 19 (3 …, 2014 | 13 | 2014 |
Integrating compiler and system toolkit flow for embedded VLIW DSP processors C Wu, KY Hsieh, YC Lin, CJ Wu, WL Shih, SC Chen, CK Chen, CC Huang, ... 12th IEEE International Conference on Embedded and Real-Time Computing …, 2006 | 9 | 2006 |
Get More With Less: Near Real-Time Image Clustering on Mobile Phones J Ortiz, CC Huang, S Chakraborty arXiv preprint arXiv:1512.02972, 2015 | 3 | 2015 |