Ziming Zheng

 Ph.D. candidate
Department of Computer Science
Illinois Institute of Technology
E-mail:
zzheng11[at] iit [dot] edu


Education

I am a Ph.D. candidate, working with Dr. Zhiling Lan in the Scalable Computing Software Lab of Computer Science Department at Illinois Institute of Technology.

I have achieved my bachelor degree in Computer Science, University of Electronic Science and Technology of China in July 2003.


Research Areas

 

  • Discovering and forecasting incipient faults in large-scale systems.
  • Fast failure recovery.

 


 

Publications

 

  • Z. Zheng and Z. Lan, "Log Analysis for Fault Management in HPC systems," Proc. of DSN (research poster), 2011.
  • L. Yu, Z. Zheng, Z. Lan, and S. Coghlan, "Practical Online Failure Prediction for Blue Gene/P: Period-based vs Event-driven," Proactive Failure Avoidance, Recovery, and Maintenance workshop (in conjunction with  DSN’11), 2011.
  • Z. Zheng, L. Yu, W. Tang, Z. Lan, R. Gupta, N. Desai, S. Coghlan, and D. Buettner, ''Co-Analysis of RAS Log and Job Log on Blue Gene/P,'' Proc. of IPDPS, 2011.
  • Z. Zheng, Z. Lan, R. Gupta, S. Coghlan and Peter Beckman, "A Practical Failure Prediction with Location and Lead Time for Blue Gene/P, " Fault-Tolerance at Extreme Scale workshop (in conjunction with  DSN’10), 2010.
  • Z. Lan, J. Gu, Z. Zheng, R. Thakur, and S. Coghlan, "A Study of Dynamic Meta-Learning for Failure Prediction in Large-Scale Systems," Journal of Parallel and Distributed Computing (JPDC), 2010.
  • Z. Lan, Z. Zheng, and Y. Li, "Toward Automated Anomaly Identification in Large-Scale Systems," IEEE Trans. on Parallel and Distributed Systems, vol. 21, no. 2, pp. 174-187, 2010.
  • Z. Zheng, R. Gupta, Z. Lan and S. Coghlan, "FTB-enabled Failure Prediction for Blue Gene/P Systems," Proc. of ACM/IEEE SuperComputing (research poster), 2009.
  • Z. Zheng and Z. Lan. "Reliability-Aware Scalability Models for High Performance Computing," Proc. of IEEE Cluster'09, Aug. 2009.
  • Z. Zheng, Z. Lan, B-H. Park, and A. Geist, "System Log Pre-processing to Improve Failure Prediction," Proc. of DSN'09, 2009.
  • H. Jin, X.-H. Sun, Z. Zheng, Z. Lan, and B. Xie, "Performance under Failures of DAG-based Parallel Computing," Proc. of CCGrid09, 2009.
  • J. Gu, Z. Zheng, Z. Lan, J. White, E. Hocks, and B-H. Park, "Dynamic Meta-Learning for Failure Prediction in Large-scale Systems: A Case Study," Proc. of ICPP08, 2008.
  • B-H. Park, Z.Zheng, Z. Lan and A. Geist, "Analyzing Failure Events on ORNL’s Cray XT4," Proc. of ACM/IEEE SuperComputing (research poster) , 2008.
  • Z. Lan, Y. Li, Z. Zheng, and P. Gujrati, "Enhancing Application Robustness through Adaptive Fault Tolerance," Proc. of the NSFNGS workshop (in conjunction with IPDPS'08), 2008.
  • X. Sun, Z. Lan, Y. Li, H. Jin, and Z. Zheng, "Towards a Fault-Aware Computing Environment," Proc. of High Availability and Performance Computing Workshop, 2008.
  • Z. Zheng, Y. Li, and Z. Lan, "Anomaly Localization in Large-scale Clusters," Proc. of IEEE Cluster'07, 2007.
  • Z. Lan, Y. Li, P. Gujrati, Z. Zheng, R. Thakur, and J. White, "A Fault Diagnosis and Prognosis Service for TeraGrid Clusters," Proc. of TeraGrid'07 , 2007.

Invited Talk

  • Workshop on Resiliency for Petascale HPC,  held in conjunction with the Los Alamos Computer Science Symposium

 


Award

 


 

Teaching Assistant

 


Internship


Activities

 

  • Student volunteer of the TeraGrid '07 conference, June 4-8, 2007, Madison, WI
  • Student volunteer of the SC '07 conference, Nov 10-16, 2007, Reno, NV
  • Student volunteer of the SC '08 conference, Nov 15-21, 2008, Austin, TX

Hobbies

  • Outdoor sports, e.g. hiking, camping & rock climbing
  • Badmintons, soccer & swimming
  • Literature, painting, photograph

 


Research links

 

Estimated impact of publication venues in Computer Science
Most Cited Conferences, Journals, Authors and Papers
Computer Science Journal Rankings
Computer Science Conference Statistics
Networking conference dates