学术论文: |
Selected Publications Journal Papers [1]H. Li*, Q. Tian, D. Xu, H. Zhao, Z. Xu. Alleviating Straggler Impacts for Data Parallel Deep Learning with Hybrid Parameter Update. Future Generation Computer Systems (FGCS), Feb. 2025.(中科院一区)DOI:10.1016/j.future.2025.107775 [2]H. Li, Z. Wang, H. Zhao, M. Zhang, X. Li, H. Xu. Convergence-aware Optimal Checkpointing for Exploratory Deep Learning Training Jobs. Future Generation Computer Systems (FGCS), Nov.Mar. 2025.(中科院一区)DOI:10.1016/j.future.2024.107597 [3]X. Wei, Z. Xu,H. Li*, J. Hao, H. Yue, C. Liu.Harnessing dynamic graph differential operators for efficient data-driven wind predictionGeoInformatica, Mar. 2025.(CCF-B期刊) DOI:10.1007/s10707-025-00542-2 [4]H. Li, H. Zhao, T. Sun, X. Li, H. Xu, K. Li. Interference-aware Opportunistic Job Placement for Shared Distributed Deep Learning Clusters. Journal of Parallel and Distributed Computing (JPDC), Jan. 2024. (CCF-B)DOI:10.1016/j.jpdc.2023.104776 [5]Z. Xu, X Wei, J Hao, J Han,H Li*, C L, Z Li, D Tian, N Zhang. DGFormer: A Physics-Guided Station Level Weather Forecasting Model with Dynamic Spatial-Temporal Graph Neural Network,GeoInformatica, Feb. 2024.(CCF-B)DOI:10.1007/s10707-024-00511-1 [6]Z. Xu, X, Wei, J. Hao, J. Li,H. Li*, Z. Ding, S. Li. HiRM: Hierarchical Resource Management for Earth System Models on Many-core Clusters. CCF Transactions on High Performance Computing (THPC). Jan. 2024.(CCF-C)DOI:10.1007/s42514-023-00176-6 [7]H. Li, J. Wu, Z. Jiang, X. Li, X. Wei. A Task Allocation Method for Stream Processing with Recovery Latency Guarantee. Journal of Computer Science and technology(JCST), vol.33, no.6, pp.1125-1139, 2018.11.(CCF-B)DOI:10.1007/s11390-018-1876-6 [8]H. Li, J. Wu, Z. Jiang, X. Li, X. Wei. Minimum Backups for Stream Processing with Recovery Latency Guarantees.IEEE Transactions on Reliability, vol.66, no.99, pp.1-12. 2017.(中科院二区)DOI:10.1109/TR.2017.2712563 [9]X. Wei, L. Li, X. Li, X. Wang, S. Gao.H. Li. Pec: Proactive Elastic Collaborative Resource Scheduling in Data Stream Processing. IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 30. No. 7, pp. 1628-1642, July 1 2019.(CCF-A)DOI:10.1109/TPDS.2019.2891587 [10]X. Wei, Z. Xu,H. Li, Z. Ding. Coordinated process scheduling algorithms for coupled earth system models. Concurrency and Computation: Practice and Experience(CCPE), e6346, Oct 25, 2021.(CCF-C) [11]Y. Zhuang, X. Wei,H. Li, Y. Wang, X. He. An optimal checkpointing model with online OCI adjustment for stream processing applications. Concurrency and Computation: Practice and Experience(CCPE). June 10, 2019.(CCF-C)DOI:10.1002/cpe.5347 [12]X. Wei, Y. Zhuang,H. Li, Z. Liu. Reliable stream data processing for elastic distributed stream processing systems. Cluster Computing. May 21, 2019.(中科院三区)DOI:10.1007/s10586-019-02939-9 [13]W. Wei, X. Wei,H. Li. Topology-aware Task Allocation for Online Distributed Stream Processing Applications with Latency Constraints. Physica A: Statistical Mechanics and its Applications. Vol. 534, Nov. 15, 2019.(中科院二区)DOI:10.1016/j.physa.2019.122024 [14]X. Wei,H. Li, K. Yang, L. Zou. Topology-aware Partial Virtual Cluster Mapping Algorithm on Shared Distributed Infrastructures. IEEE Transactions of Parallel and Distributed Systems (TPDS), vol.25, no.10, pp.2721-2730, October 2014.(CCF-A)DOI:10.1109/TPDS.2013.224 [15]H. Li, X. Wei, Q. Fu, Y. Luo. MapReduce Delay Scheduling with Deadline Constraint. Concurrency and Computation: Practice and Experience(CCPE), vol.26, no.3, pp.766-778, March 10, 2014.(CCF-C)DOI:10.1002/cpe.3050 [16]X. Wei, Y. Jin,H. Li, X. Wang and S. Hu. Virtual Resource Consolidation for Green Computing Based on Virtual Cluster Live Migration. Journal of Communications, vol.11, no.2, pp.192-202, February 2016. DOI:10.12720/jcm.11.2.192-202 [17]X. Wei, W. Li, H. Tian,H. Li, H. Xu, T. Xu. THC-MP: High Performance Numerical Simulation of Reactive Transport and Multiphase Flow in Porous Media. Computers & Geosciences, vol. 80, pp.26-37, 2015. [18]X. Wei, X. Bai, S. Bai andH. Li. On-demand Tile Preload for Large-scale Seismic Data 3D-visualization. Journal of Computational Information Systems, vol.11, no.4, pp.1513-1520, February 2015. DOI:10.12733/jcis13585 [19]X. Wei, S. Hu,H. Li, F. Yang, Y. Jin. A survey on virtual network embedding in cloud computing centers. Open Automation and Control Systems Journal, vol.6, no.1, pp.414-425, 2014. [20]X. Wei,H. Li, L. Hu, Q. Guo, N. Jiang. LimeVI: A Platform for Virtual Cluster Live Migration over WAN. International Journal of Computer Systems Science and Engineering (CSSE), vol.26, No.5, pp.353-364, September 2011. Conference Papers [1]H. Zhao,H. Li*, Q. Tian, Z. Chen. FlexPipe: Maximizing the Training Efficiency for Transformer-based models with Variable-Length Inputs. USENIX Annual Technical Conference (USENIX ATC2025), Jul. 07-09, 2025.(CCF-A) [2]H. Zhao,H. Li*, Q. Tian, J. Wu, M. Zhang, Z. Xu, X. Li, H. Xu. ArrayPipe: Introducing Job-Array Pipeline Parallelism for High Throughput Model Exploration. IEEE International Conference on Computer Communications (INFOCOM2025), May, 2025.(CCF-A) [3]H. Li, H. Zhao, Z. Xu, X. Li, and H. Xu. ExplSched: Maximizing Deep Learning Cluster Efficiency for Exploratory Jobs. IEEE International Conference on Cluster Computing (CLUSTER2023), Oct. 31, 2023, Santa Fe, New Mexico, USA.(CCF-B) [4]H. Zhao, X. Li,H. Li*. Visage: Visual-Aware Generation of Adversarial Examples in Black-Box for Text Classification. The 13th CCF International Conference on Natural Language Processing and Chinese Computing (NLPCC 2024), Nov. 1, 2024, Hangzhou, China.(theBestPaperAward)(CCF-C) [5]H. Li, D. Xu, Z. Xu, X. Li. Hybrid Parameter Update: Alleviating Imbalance Impacts for Distributed Deep Learning. 24th IEEE International Conference on High Performance Computing and Communications (HPCC2022), Dec. 2022.(CCF-C) [6]H. Li, T. Sun, X. Li, H. Xu. Job Placement Strategy with Opportunistic Resource Sharing for Distributed Deep Learning Clusters. 2020 IEEE 22nd International Conference on High Performance Computing and Communications (HPCC2020), Dec. 2020.(CCF-C) [7]H. Li, Z Xu, F. Tang, X. Wei, Z. Ding. CPSA: A Coordinated Process Scheduling Algorithm for Coupled Earth System Model. 2020 29th International Conference on Computer Communication and Networks (ICCCN), August 2020.(CCF-C) [8]Y. Zhuang, X. Wei,H. Li*, M. Hou, Y. Wang. Reducing Fault-tolerant Overhead for Distributed Stream Processing with Approximate Backup. 2020 29th International Conference on Computer Communication and Networks (ICCCN), August 2020.(CCF-C) [9]Y. Zhuang, X. Wei,H. Li*, Y. Wang and X. He. An Optimal Checkpointing Model with Online OCI Adjustment for Stream Processing Applications. 2018 27th International Conference on Computer Communication and Networks (ICCCN), pp. 1-9, July 30 2018, Hangzhou, China.(CCF-C)DOI:10.1109/ICCCN.2018.8487327 [10]H. Li, J. Wu, Z. Jiang, X. Li, X. Wei. Task Allocation for Stream Processing with Recovery Latency Guarantee. in Cluster Computing (CLUSTER), 2017 IEEE International Conference on. IEEE, 2017, pp. 379–383.(CCF-B)DOI:10.1109/CLUSTER.2017.10 [11]H. Li, J. Wu, Z. Jiang, X. Li, X. Wei, Y. Zhuang. Integrated Recovery and Task Allocation for Stream Processing. 2017 IEEE 36th International Performance Computing and Communications Conference (IPCCC), Dec. 10, 2017, San Diego, CA, USA.(CCF-C)DOI:10.1109/PCCC.2017.8280443 |