
| 
   
  | 
  ||
| 
   黄宜华 博士, 教 授,博导  | 
  
   Yihua
  Huang, Ph.D., Professor  | 
 |
| 
   Department
  of Computer Science & Technology Nanjing University   | 
 ||
| 
   中国计算机学会大数据专家委员会  | 
  
   副主任  | 
 |
| 
   江苏省计算机学会大数据专家委员会  | 
  
   主任  | 
 |
| 
   江苏省数字经济学会  | 
  
   副理事长  | 
 |
| 
   | 
 |||||||||
| 
   联系信息 Contact Information  | 
 |||||||||
| 
   邮件:  | 
  
   黄宜华  | 
  
   Mail:  | 
  
   Yihua
  Huang  | 
 ||||||
| 
      | 
  
   南京大学计算机科学与技术系  | 
  
      | 
  
   Department
  of Computer Science & Technology, Nanjing University  | 
 ||||||
| 
   南京市栖霞区仙林大道163号  | 
  
   163
  Xianlin Road, Nanjing  | 
 ||||||||
| 
   中国南京 210023  | 
  
   Nanjing
  210023, China  | 
 ||||||||
| 
   办公室:  | 
  
   计算机系大楼408室  | 
  
   Office:  | 
  
   408,
  Computer Department Building  | 
 ||||||
| 
   南京大学仙林校区  | 
  
   Xianlin
  Campus of Nanjing University  | 
 ||||||||
| 
   电话:  | 
  
   025-8968-6517  | 
  
   Tel:  | 
  
   025-8968-6517  | 
 ||||||
| 
   邮箱:  | 
  
   Email:  | 
  ||||||||
| 
       主要研究兴趣                               Research
  Interest  | 
  
 |
| 
            大数据智能化分析应用  | 
  
  
   Analytic
  Applications for Big Data   | 
  
 
| 
            大数据分布并行处理  | 
  
  
   Big Data
  Distributed & Parallel Processing   | 
  
 
| 
            大数据机器学习算法与系统  | 
  
  
   Machine Learning
  Algorithms & Systems for Big Data  | 
  
 
| 
            文本语义分析  | 
  
  
   Text semantic
  analysis  | 
  
 
| 
           
  Web数据挖掘  | 
  
  
   Web Data Mining  | 
  
 
| 
   21.   视图公共安全应用体系建设科技示范  | 
  
  
   21. Applications for Public Safety for
  Large-scale Media Data   | 
  
 
| 
   江苏省科技厅重点项目课题(项目号BE2021729)  | 
  
  
   Jiangsu Province
  Science & Tech Research Program(BE2021729))
    | 
  
 
| 
   2021-2023,课题负责人  | 
  
  
         2021-2023,PI  | 
  
 
| 
   20.   中药分子标识研究及中药智慧云信息平台建设  | 
  
  
   20. Study on Herb Molecular Markers &
  Chinese Herb Cloud Information Platform  | 
  
 
| 
   国家重点研发计划项目(2019YFC1711000)  | 
  
  
     National Key R&D Program of China(2019YFC1711000)  | 
  
 
| 
   2020-2021,课题负责人  | 
  
  
     2020-2021,PI for
  sub-project  | 
  
 
| 
   19.  大数据计算的混合编程环境与大数据分析处理系统支撑平台  | 
  
  
   19. Hybrid Programming Environment &
  Platform for Big Data Analytics & Processing   | 
  
 
| 
   国家自然基金重点课题(项目号U181461)  | 
  
  
     China NSF Research Program(#U181461)  | 
  
 
| 
   2019-2022,课题负责人  | 
  
  
     2019-2022,PI  | 
  
 
| 
   18.  跨平台统一大数据分析处理与可视化编程系统平台
    | 
  
  
   18. Cross-platform Big Data Analytic &
  Processing & Virtual Programming Platform  | 
  
 
| 
   江苏省科技厅重点项目(项目号BE2017155)  | 
  
  
     Jiangsu Province Science & Tech
  Research Program(#
  BE2017155)  | 
  
 
| 
   2017-2020,项目负责人  | 
  
  
     2017-2020,PI  | 
  
 
| 
   17.  大数据OLAP分析引擎及Flink实时计算技术  | 
  
  
   17. OLAP Analytic Engine & Flink
  Real-time Computation for Big Data  | 
  
 
| 
   华为合作项目,2020  | 
  
  
     Huawei, 2020  | 
  
 
| 
   16.  AutoML算法平台及其应用
    | 
  
  
   16. AutoML Algorithms,Platform
  & Applications  | 
  
 
| 
   华为合作项目,2018-2019  | 
  
  
   Huawei,2018-2019  | 
  
 
| 
   15.  证券行情数据回放系统与统一大数据分析平台  | 
  
  
   15. Securities Market Data Replay System
  & Unified Big Data Analytic Platform   | 
  
 
| 
        华泰证券,2017-2018  | 
  
  
     Huatai,2017-2018  | 
  
 
| 
   14.  基于Alluxio的多HDFS
  NameNode路由选择和热数据缓存       | 
  
  
   14. Large Scale Text Analysis & Deep
  Recommendation Algorithm & System   | 
  
 
| 
   苏宁云商,2017  | 
  
  
     Suning,2017  | 
  
 
| 
   13.  大规模文本分析与深度推荐算法与系统      | 
  
  
   13. Large Scale Text Analysis & Deep
  Recommendation Algorithm & System   | 
  
 
| 
   微软亚洲研究院合作项目,2016-2017,项目负责人    | 
  
  
     Microsoft Asia Research Lab Program, 2016-2017,PI  | 
  
 
| 
   12.  大数据分层式存储系统缓存调度策略与框架研究     | 
  
  
   12. Cache Schedule Policy & Framework
  for Hierarchical Big Data Storage System   | 
  
 
| 
   Intel公司合作项目,2016-2017,项目负责人    | 
  
  
     Intel Research Program,2016-2017,PI  | 
  
 
| 
   11.  大数据机器学习与数据分析统一编程计算模型与关键技术研究    | 
  
  
   11. Unified Programming Model & Key
  Techniques for Big Data Machine Learning & Data Analysis  | 
  
 
| 
   国家自然基金面上项目 (项目号61572250)   | 
  
  
     China NSF Research Program (#61572250)  | 
  
 
| 
   2016-2019,项目负责人  | 
  
  
     2016-2019,PI  | 
  
 
| 
   10.
  大数据并行化分析计算统一编程框架与软件平台    | 
  
  
   10. Unified Programming Framework &
  Platform for Big Data Analysis  | 
  
 
| 
   江苏省科技支撑计划项目(项目号BE2014131)  | 
  
  
     Jiangsu Province Science & Tech.
  Support Program(BE2014131)  | 
  
 
| 
   2014-2017,项目负责人  | 
  
  
     2014-2017,PI  | 
  
 
| 
   9.
  大规模软件结构智能化分析算法与系统平台    | 
  
  
    
  9. Algorithms & Platform for Large Scale Software Structure
  Analysis  | 
  
 
| 
   华为公司合作项目,2015-2016,项目负责人    | 
  
  
     Huawei,2015-2016,PI  | 
  
 
| 
   8.
  Apache Alluxio优化与功能增强  | 
  
  
    
  8. Optimization and Enhancement for Apache Alluxio  | 
  
 
| 
   Apache Alluxio开源社区合作研究, 2014-现在  | 
  
  
     Apache Alluxio Open Source Research,
  2014-Present  | 
  
 
| 
   7.
  Apache Spark优化与功能增强  | 
  
  
    
  7. Optimization and Enhancement for Apache Spark  | 
  
 
| 
   Apache Spark开源社区合作研究, 2014-2015  | 
  
  
     Apache Spark Open Source Research,
  2014-2015  | 
  
 
| 
   6.
  面向大数据的媒体内容分析与关联语义挖掘研究  | 
  
  
    
  6. Research on Big Media Data Content Analysis and Associated Semantic
  Mining   | 
  
 
| 
   国家自然科学基金专项基金项目(项目号61223003)  | 
  
  
         China National
  Science Foundation Special Research Grant(#61223003)  | 
  
 
| 
          2013.1-2016.12,项目主要参与者   | 
  
  
     1/2013-12/2016, Co-PI   | 
  
 
| 
   5.
  Gradient Boosting决策树Spark并行化训练算法研究
    | 
  
  
    
  5. Gradient Boosting Decision Tree Parallel Training Algorithm with
  Spark  | 
  
 
| 
      百度主题研究项目, 2014,项目负责人  | 
  
  
         Baidu Research
  Project, 2014, PI  | 
  
 
| 
   4.
  HBase二级索引与查询技术研究  | 
  
  
    
  4. Secondary Index and Query for HBase   | 
  
 
| 
       中兴通讯,项目负责人,2013-2014  | 
  
  
         ZTE, China.
  2013-2014, PI  | 
  
 
| 
       3. 大规模中文文本语义分析与医疗文本挖掘  | 
  
  
    
  3. Large Scale Chinese Text Semantic Analysis and Medical Record
  Mining    | 
  
 
| 
           美国Intel Labs研究项目,  2013.4-2014.3,项目负责人  | 
  
  
         USA Intel Labs
  URO Funding, 4/2013-3/2014, PI  | 
  
 
| 
   2.
  面向复杂结构的精确Web信息抽取集成模型与关键技术研究  | 
  
  
    
  2. Research on Model and Techniques for Web Info Extraction &
  Integration   | 
  
 
| 
           国家自然科学基金面上项目(项目号61072152)  | 
  
  
     China National Science Foundation
  Research Grant(#61072152)   | 
  
 
| 
   2011.1-2013.12,项目负责人  | 
  
  
     1/2011-12/2013, PI  | 
  
 
| 
       1. 精确信息定制服务Web信息抽取集成通用引擎与服务软件平台  | 
  
  
    
  1. Accurate Web Info Extraction and Integration Engine and Service
  Platform  | 
  
 
| 
           江苏省科技支撑计划项目(项目号BE2011172)  | 
  
  
         Jiangsu
  Province Science & Technology Research Grant (#BE2011172)  | 
  
 
| 
          
  2011.4-2013.12,项目负责人  | 
  
  
         4/2011-12/2013,
  PI  | 
  
 
| 
   | 
  
  
   | 
  
 
| 
      
  主要学习和工作经历                
                        
      | 
 ||
| 
          2008-现在   南京大学计算机科学与技术系  教授  | 
  
      | 
  |
| 
          2002-2008  美国佐治亚医学院生物技术与基因药物研究中心
  研究员  | 
  
      | 
  |
| 
         
  1998-2001  美国佛罗里达大学数据库研究中心
  访问学者  | 
  
      | 
  |
| 
         
  1998-2001  南京大学计算机科学与技术系 
  教授  | 
  
      | 
  |
| 
          1993-1997
   南京大学计算机科学与技术系 
  副教授  | 
  
      | 
  |
| 
         
  1988-1993  南京大学计算机科学与技术系 
  讲师  | 
  
      | 
  |
| 
         
  1986-1988  南京大学计算机科学与技术系 
  助教  | 
  
      | 
  |
| 
         
  1994-1997  南京大学计算机科学与技术系 
  博士  | 
  
      | 
  |
| 
         
  1983-1986  南京大学计算机科学与技术系 
  研究生  | 
  
      | 
  |
| 
         
  1979-1983  南京大学计算机科学与技术系 
  本科  | 
  
      | 
  |
| 
      | 
  
      | 
 ||
| 
   讲授课程:  | 
  
   大规模数据并行处理(本科与研究生)  | 
  
   曾开设课程:   | 
  
   Web技术与应用开发  | 
 
| 
      | 
  
      | 
  
   计算机原理  | 
 |
| 
   课程建设:  | 
  
   计算机硬件类课程群建设与实验教学研究  | 
  
      | 
  
   微机原理与接口   | 
 
| 
      | 
  
      | 
  
      | 
  
   程序设计语言  | 
 
| 
   年级导师:  | 
  
      | 
  
   中文信息处理  | 
 |
| 
      | 
  
      | 
  
   数字电路设计   | 
 |
| 
      | 
  
      | 
  
      | 
 |
| 
   研究生培养:  | 
  |||
| 
   2020年研究生团队荣获 KDD Cup AutoML国际大赛第二名  | 
  
   | 
 
| 
   2019年研究生团队荣获第五届中国互联网+大学生创新创业大赛国赛金奖  | 
  
   | 
 
| 
   2019年研究生团队荣获 NeurIPS AutoSpeech 国际大赛第一名
    | 
  
   | 
 
| 
   2019年研究生团队荣获 NeurIPS AutoDL 国际大赛第三名
    | 
  
   | 
 
| 
   2019年研究生团队荣获 KDD Cup AutoML 国际大赛TOP10优胜奖  | 
  
   | 
 
| 
   2019年研究生团队荣获 ACML AutoSpeech 国际大赛第一名  | 
  
   | 
 
| 
   2019年研究生团队荣获 ACML AutoWSL 国际大赛第四名  | 
  
   | 
 
| 
   2019年研究生团队荣获 WAIC AutoNLP 国际大赛第七名  | 
  
   | 
 
| 
   2018年研究生团队荣获 NeurIPS AutoML 国际大赛第三名
    | 
  
   | 
 
| 
   2020年研究生团队荣获 2018 PAKDD AutoML国际大赛第三名  | 
  
   | 
 
| 
   2016年研究生团队荣获SortBenchmark国际排序大赛CloudSort国际冠军  | 
  |
| 
   2016年研究生团队荣获教育部第二届全国高校云计算应用创新大赛大数据技能赛冠军  | 
  |
| 
   2015年研究生团队荣获教育部第一届全国高校云计算应用创新大赛大数据技能赛冠军  | 
  |
| 
   2012年Google奖教金  | 
  |
| 
   2012年课程研究生组队参赛第一届“中国云/移动互联网创新大奖赛”,获得9项奖  | 
  |
| 
   2000年江苏省科技进步二等奖  | 
  |
| 
   1993年江苏省科技进步二等奖  | 
  |
| 
   1997年第三届中国PC应用软件设计大赛优胜奖  | 
  |
| 
   1997/1996/1995年南京大学优秀青年教师  | 
  |
| 
   1995年 江苏省八五先进科技工作者  | 
  |
| 
   1995年国家教委教材二等奖,南京大学优秀教材一等奖  | 
  |
| 
   1992年江苏省优秀软件一等奖  | 
  |
| 
   1991年南京大学科技开发特别贡献奖  | 
  |
| 
   | 
  |
| 
     兴趣爱好                
                        
      | 
  
  | 
 
| 
      | 
  
      | 
 
| 
      乒乓球,阅读,哲学,中国传统文化,中医保健  | 
  |
| 
      | 
  
   
  | 
 
| 
      
  书籍与发表论文                
                        
    Publications  | 
 
| 
  
   书籍:《深入理解大数据―大数据处理与编程实践》,机械工业出版社,2014,国家教委计算机教指委计算机类专业系统能力培养系列教材。 研究论文: 1.      Rong
  Gu, Han Yin, Weichang Zhong, Chunfeng Yuan, Yihua Huang. Meces:
  Latency-effcient Rescaling via Prioritized State Migration for Stateful
  Distributed Stream Processing Systems. accepted by USENIX Annual Technical
  Conference (USENIX ATC 2022,CCF-A类会议), to appear. 2.      Jingfan
  Chen, Wenqi Fan, Guanghui Zhu, Xiangyu Zhao, Chunfeng Yuan, Qing Li, and
  Yihua Huang. Knowledge-enhanced Black-box Attacks for Recommendations.
  accepted by the 28rd SIGKDD conference on Knowledge Discovery and Data Mining
  (SIG KDD 2022,CCF A), to appear. 3.      Guanghui
  Zhu, Zhuoer Xu, Chunfeng Yuan, and Yihua Huang. DIFER: Differentiable
  Automated Feature Engineering. accepted by the 1st International Conference
  on Automated Machine Learning (AutoML-Conf 2022),  to appear. 4.      Rong
  Gu, Kai Zhang, Zhihao Xu, Yang Che, Bin Fan, Haojun Hou, Haipeng Dai, Li Yi,
  Yu Ding, Guihai Chen and Yihua Huang. Fluid: Dataset Abstraction and Elastic
  Acceleration for Cloud-native Deep Learning Training Jobs. Accpted by (IEEE
  ICDE 2022, CCF-A), to appear.  5.      Rong
  Gu, Yuquan Chen, Shuai Liu, Haipeng Dai, Guihai Chen, Kai Zhang, Yang Che, and
  Yihua Huang. Liquid: Intelligent Resource Estimation and Network-Efficient
  Scheduling for Deep Learning Jobs on Distributed GPU Clusters. Accpted by
  (IEEE TPDS, CCF-A), to appear.  6.      Rong
  Gu, Jun Shi, Xiaofei Chen, Zhaokang Wang, Yang Che, Kai Zhang, Yihua Huang.
  Octopus-DF: Unified DataFrame-based Cross-platform Data Analytic System.
  Accpted by (PARCO, CCF-B), to appear.  7.      Guanghui
  Zhu, Feng Cheng, Defu Lian, Chunfeng Yuan, and Yihua Huang. NAS-CTR:
  Efficient Neural Architecture Search for Click-Through Rate Prediction. Proc.
  of the ACM 45th International ACM SIGIR Conference on Research and
  Development in Information Retrieval (SIGIR 2022, CCF A), accepted, 2022.  8.      Jingfan
  Chen, Guanghui Zhu, Haojun Hou, Chunfeng Yuan, and Yihua Huang. AutoGSR:
  Neural Architecture Search for Graph-based Session Recommendation. Proc. of
  the ACM 45th International ACM SIGIR Conference on Research and Development
  in Information Retrieval (SIGIR 2022, CCF A), accepted, 2022.  9.      Guanghui
  Zhu, Wenjie Wang, Zhuoer Xu, Feng Cheng, Mengchuan Qiu, Chunfeng Yuan, and
  Yihua Huang. PSP: Progressive Space Pruning for Efficient Graph Neural
  Architecture Search. Proc. of the IEEE 38th International Conference on Data
  Engineering (ICDE 2022, CCF A), accepted, 2022.  10.    Chengcheng
  Mai, Mengchuan Qiu, Kaiwen Luo, Ziyan Peng, Jian Liu, Chunfeng Yuan, Yihua
  Huang. Pretraining Multi-modal Representations for Chinese NER Task with
  Cross-Modality Attention. Proceedings of the Fifteenth ACM International
  Conference on Web Search and Data Mining (WSDM, CCF-B), pp. 726�734, 2022. 11. Rong Gu, Zhiqiang Zuo, Xi Jiang, Han Yin, Zhaokang Wang, Linzhang Wang, Xuandong Li, and Yihua Huang. Towards Efficient Large-scale Interprocedural Program Static Analysis on Distributed Data-Parallel Computation. IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS, CCF-A). Vol.32(4), 2021, pp. 867-883. 12. Zhaokang Wang, Weiwei Hu, Guowang Chen, Chunfeng Yuan, Rong Gu, Yihua Huang. Towards Efficient Distributed SubgraphEnumeration via Backtracking-based Framework. IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS, CCF-A). Vol.32(12), 2021, pp. 2953-2969. 13. Rong Gu, Yang Qi, Tongyu Wu, Zhaokang Wang, Xiaolong Xu, Chunfeng Yuan, Yihua Huang. SparkDQ: Efficient Generic Big Data Quality Management on Distributed Data-Parallel Computation. Journal of Parallel and Distributed Computing (JPDC, CCF-B). Vol.156(1), 2021, pp. 132-147. 14. Rong Gu, Chongjie Li, Haipeng Dai, Yili Luo, Xiaolong Xu, Shaohua Wan, Yihua Huang. Improving In-Memory File System Reading Performance by Fine-Grained User-Space Cache Mechanisms. Journal of Systems Architecture (JSA, CCF-B). Vol.115(1), 2021, pp. 1-15. 15. Zhaokang Wang, Shen Wang, Junhong Li, Chunfeng Yuan, Rong Gu and Yihua Huang. Distributed Local Structural Vertex Similarity Calculation on Big Graphs. Journal of Parallel and Distributed Computing (JPDC, CCF-B). Vol.158(1), 2021, pp. 29-46. 16.    Zhaokang
  Wang, Yunpan Wang, Chunfeng Yuan, Rong Gu, Yihua Huang. Empirical Analysis of
  Performance Bottlenecks in Graph Neural Network Training and Inference with
  GPUs. NeuroComputing (CCF-C). Vol.446(1), 2021, pp. 165-191.  17.    Zhuoer
  Xu, Guanghui Zhu, Chunfeng Yuan, and Yihua Huang. One-Stage Tree: End-to-End
  Tree Builder and Pruner. Machine Learning Journal (MLJ, CCF B), pp.1-27,
  2021.  18.    Zhaokang
  Wang, Junhong Li, Yifan Qi, Guanghui Zhu, Chunfeng Yuan, and Yihua Huang.
  UniGPS: A Unified Programming Framework for Distributed Graph Processing.
  Proc. of the 27th International Conference on Parallel and Distributed
  Systems (ICPADS, CCF C), accepted, 2021.  19.    Guanghui
  Zhu, Feng Cheng, Mengchuan Qiu, Zhuoer Xu, Wenjie Wang, Chunfeng Yuan, and
  Yihua Huang. Progressive AutoSpeech: An Efficient and General Framework for
  Automatic Speech Classification. Proc. of the 25th Pacific-Asia Conference on
  Knowledge Discovery andData Mining (PAKDD, CCF C), pp. 168-180, India, 2021.  20.    Chengcheng
  Mai, Xueming Qiu, Kaiwen Luo, Min Chen, Bo Zhao, Yihua Huang. TSSE-DMM: Topic
  Modeling for Short Texts Based on Topic Subdivision and Semantic Enhancement.
  Proceedings of the 24th Pacific-Asia Conference on Knowledge Discovery and
  Data Mining (PAKDD, CCF-C), pp. 640�651, India, 2021. 21. 麦丞程、陈玉婷、仇学明、刘健、赵博、袁春风、黄宜华. 公共服务热线中基于地域自适应的突发事件实时检测方法. 《计算机学报》 (CCF A),2020,Vol. 43 (12) : 2259-2275. 22.    Guanghui
  Zhu and Ruancheng Zhu. Accelerating Hyperparameter Optimization of Deep
  Neural Network via Progressive Multi-Fidelity Evaluation. Proc. of the 24th
  Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD, CCF
  C), pp. 752-763, Singapore, 2020. 23.    Zhaokang
  Wang, Rong Gu, Weiwei Hu, Chunfeng Yuan, Yihua Huang. BENU: Distributed
  Subgraph Enumeration With Backtracking-based Framework. Proc. of the IEEE
  International Conference on Data Engineering (ICDE 2019), 136-147, 2019, DOI
  10.1109/ICDE.2019.00021.  24.    Guanghui
  Zhu, Xiaoqi Wu,Chunfeng Yuan, Yihua Huang.
  HyMJ: A Hybrid Structure-Aware Approach to Distributed Multi-Way Join Query.
  IEEE International Conference on Data Engineering (ICDE 2019),short paper 25.    Guanghui
  Zhu, Qian Wang, Qiwei Tang, Rong Gu, Chunfeng Yuan and Yihua Huang. Efficient
  and Scalable Functional Dependency Discovery on Distributed Data-Parallel
  Platforms. IEEE Transactions on Parallel and Distributed Systems (TPDS'2019).
  30(2): 2663-2676 (2019)  DOI:
  10.1109/TPDS.2019.292501414. 26.    Rong
  Gu, Yufa Zhou, Zhaokang Wang, Chunfeng Yuan, and Yihua Huang. Penguin:
  Efficient Query-based Framework for Replaying Large Scale Historical Data.
  IEEE Transactions on Parallel and Distributed Systems (TPDS'18).2018, DOI:
  10.1109/TPDS.2018.2829759 27.    Rong
  Gu, Yun Tang, Chen Tian, Hucheng Zhou, Guanru Li, Xudong Zheng, and Yihua
  Huang. Improving Execution Concurrency of Large-Scale Matrix Multiplication
  on Distributed Data-Parallel Platforms. IEEE Transactions on Parallel and
  Distributed Systems (TPDS'17). Vol.28(9), 2017, pp. 2539-2552. 28.    Guanghui
  Zhu, QiuHu,Rong Gu, Chunfeng Yuan, and Yihua Huang. ForestLayer: Efficient
  training of deep forests on distributed task-parallel platforms. Journal of
  Parallel and Distributed Computing (JPDC'2019) . Vol(132):113-126.  29.    Guanghui
  Zhu, Chen Guo, Le Lu, Zhi Huang, Chunfeng Yuan, Rong Gu* and Yihua Huang*.
  DGST: Efficient and Scalable Suffix Tree Construction on Distributed
  Data-Parallel Platforms. Parallel Computing (PC'2019), Vol(87):87-102.  30.    Rong
  Gu, Shiqing Fan, Qiu Hu, Chunfeng Yuan and Yihua Huang. Parallelizing Machine
  Learning Optimization Algorithms on Distributed Data-Parallel Platforms with
  Parameter Server. Proc. of the 24th International Conference on Parallel and
  Distributed Systems (ICPADS 2018), pp. 126-133, Sentosa, Singapore, Dec.11 -
  13, 2018.  31.    Rong
  Gu, Chongjie Li, Peng Shu, Chunfeng Yuan, Yihua Huang. Adaptive Cache Policy
  Scheduling for Big Data Applications on Distributed Tiered Storage System.
  Concurrency and Computation: Practice and Experience. 31(15):1-25 (2019)
  DOI:10.1002/cpe.5138 .  32.    Rong
  Gu, Min Chen, Wenjia Yang, Chunfeng Yuan and Yihua Huang. Seal: Efficient
  Training Large Scale Statistical Machine Translation Models on Spark. Proc.
  of the 24th International Conference on Parallel and Distributed Systems
  (IEEE ICPADS 2018), pp. 118-125, Sentosa, Singapore, Dec.11 - 13, 2018.  33.    Rong
  Gu, Kaixuan Huang, Zhixiang Zhang, Chunfeng Yuan and Yihua Huang. Push-based
  Network-efficient Hadoop YARN Scheduling Mechanism for In-memory Computing.
  Proc. of the 25th International Conference on Parallel and Distributed
  Systems (IEEE ICPADS 2019),133-140, 2019. DOI 10.1109/ICPADS.2019.00026 34.    Guanghui
  Zhu, Xiaoqi Wu, RongGu, Chunfeng Yuan, Yihua Huang. AutoMJ: Towards Efficient
  Multi-way Join Query on Distributed Data-Parallel Platform. in Proceedings of
  the 23rd International Conference on Parallel and Distributed Systems (ICPADS
  2017), pp. 161-169, Shenzhen, China, 15-17 Dec., 2017. 35.    Wei
  Ge, Xianxian Li,Chunfeng Yuan, Yihua Huang.
  Correlation-aware partitioning for skewed range query optimization. World
  Wide Web(WWW'18), 2018, pp 1-27. 36.    Bo
  Zhao, Hucheng Zhou, Guoqiang Li, and Yihua Huang . ZenLDA: Large-Scale Topic
  Model Training on Distributed Data-Parallel Platform. Big Data Mining and
  Analytics, March 2018, 1(1): 57-74  37.    Peng
  Shu, RongGu, Qianhao Dong, Chunfeng Yuan, Yihua Huang. Accelerating Big Data
  Applications on Tiered Storage System with Various Eviction Policies. Proc.
  of the IEEE International Symposium on Parallel and Distributed Processing
  with Applications (IEEE ISPA 2016), pp. 1350 - 1357, Tianjin, China, 23-26
  August, 2016. 38.    Stock
  Market Prediction Exploiting Microblog Sentiment Analysis. Bo Zhao, Yongji
  He, Chunfeng Yuan and Yihua Huang. International Joint Conference on Neural
  Networks (IJCNN 2016), 24-29 July, p4482-4488, Vancouver, Canada.  39.    PTR:
  Phrase-Based Topical Ranking for Automatic Keyphrase Extraction in Scientific
  Publications. Minmei Wang,Bo Zhao,Chunfeng Yuan, Yihua Huang.
  International Conference on Neural Information Processing(ICONIP2016),2016.10.16-21.
  Tokyo, Japan 40.    Goldfish:基于矩阵分解的大规模RDF数据存储与查询系统. 顾荣, 仇红剑, 杨文家, 胡伟, 袁春风, 黄宜华. 《计算机学报》, 2017年10期,p2212-2230  41.    SCoS:基于Spark的并行谱聚类算法设计与实现. 朱光辉,黄圣彬,袁春风,黄宜华. 《计算机学报》,2017第6期  42.    Rong
  Gu, Shanyong Wang, FangFang Wang, Chufeng Yuan, Yihua Huang. Cichlid:
  Efficient Large Scale RDFS/OWL Reasoning with Spark. 2015 IEEE International
  Parallel & Distributed Processing Symposium (IPDPS 2015), India, May
  25-29, 2015 43.    Rong
  Gu, Xiaoliang Yang, Jinshuang Yan, Yuanhao Sun, Bing Wang, Chunfeng Yuan, and
  Yihua Huang. SHadoop: Improving MapReduce Performance By Optimizing Job
  Execution Mechanism in Hadoop Clusters. Journal of Parallel and
  Distributed Computing(JPDC'14). Vol.74(3), 2014, pp. 2166-2179. 44.    Rong
  Gu, Wei Hu, Yihua Huang. Rainbow: A Distributed and Hierarchical RDF Triple
  Store with Dynamic Scalability. Proc. of the 2014 IEEE International
  Conference on Big Data (IEEE BigData 2014), p 561-566, Oct. 27-30, 2014.
  Washington, USA. 45.    Shengsheng
  Shi, Chengfei Liu, Chunfeng Yuan, Yihua Huang. Multi-Feature and DAG-Based
  Multi-Tree Matching Algorithm for Automatic Web Data Mining. The 2014 Web
  Intelligence Congress(WI 2014), Aug. 11-14, 2014. Warsaw, Poland. 46.    Hongjian
  Qiu, Rong Gu, Chunfeng Yuan and Yihua Huang. YAFIM: A Parallel Frequent
  Itemset Mining Algorithm with Spark. The 3rd International Workshop on Parallel
  and Distributed Computing for Large Scale Machine Learning and Big Data
  Analytics(ParLearning 2014), conjunction with IPDPS 2014, May 23, 2014.
  Phoenix, USA  47.    Lei
  Jin, Rong Gu, Chunfeng Yuan and Yihua Huang. Large Scale Deep Learning On
  Xeon Phi Many-core Coprocessor. The 3rd International Workshop on Parallel
  and Distributed Computing for Large Scale Machine Learning and Big Data
  Analytics(ParLearning 2014), conjunction with IPDPS 2014, May 23, 2014.
  Phoenix, USA 48.    Ge,
  Wei; Huang, Yihua; Zhao, Di; Luo, Shengmei; Yuan, Chunfeng; Zhou, Wenhui;
  Tang, Yun; Zhou, Juan. CinHBa: A secondary index with hotscore caching policy
  on key-value data store. Lecture Notes in Computer Science (including
  subseries Lecture Notes in Artificial Intelligence and Lecture Notes in
  Bioinformatics), v 8933, p 602-615, 2014. 49.    顾荣, 王芳芳, 袁春风, 黄宜华. YARM:基于MapReduce的高效可扩展的语义推理引擎. 《计算机学报》,01期,pp 74-85,2014/8. 50.    顾荣,严金双, 杨晓亮, 袁春风, 黄宜华. Hadoop MapReduce短作业执行性能优化. 《计算机研究与发展》,2014,Vol.
  51 (6): 1270-1280. 51.    赵博, 黄书剑, 戴新宇, 袁春风, 黄宜华. 基于分布内存数据库的并行化层次短语机器翻译算法.《计算机研究与发展》,2014,Vol.
  51 (12): 2724-2732.  52.    Rong
  Gu, Furao Shen, and Yihua Huang. A Parallel Computing Platform for Training
  Large Scale Neural Networks. Proceedings of the IEEE International Conference
  on Big Data (IEEE BigData 2013), pp. 376 - 384, Santa Clara, CA, USA, Oct.
  6-9, 2013. 53.    Shengsheng
  Shi, Wu Wei, Yulong Liu, Haitao Wang, Lei Luo, Chunfeng Yuan, and Yihua
  Huang. NEXIR: A Novel Web Extraction Rule Language toward a Three-Stage Web
  Data Extraction Model. The 14th International Conference on Web Information
  System Engineering (WISE2013), Nanjing, China, 13-15 Oct. 2013. WISE 2013,
  Part I, “Lecture Notes in Computer Science” Proceedings 8180, p29-42,
  Springer-Verlag Berlin Heidelberg, 2013.  54.    Wu
  Wei, Shengsheng Shi, Yulong Liu, Haitao Wang, Chunfeng Yuan and Yihua Huang.
  Extraction Rule Language for Web Information Extraction and Integration. The
  10th Web Information System and Application Conference(ISA2013), p65-70, Nov.
  1-3, Yangzhou, China, 2013.  55.    Shengsheng
  Shi, FuliangQuan,Tao Xie,Chunfeng Yuan and Yihua Huang. Layered and Weighted
  Tree Matching Algorithm for Automatic Web Data Records Recognition, The 10th
  Web Information System and Application Conference(WISA 2013),p55-60, Nov. 1-3,
  Yangzhou, China, 2013.  56.    Yi
  Shen, Shengsheng Shi, Haitao Wang, Wu Wei, Chunfeng Yuan, and Yihua Huang.
  Parallel Approach and Platform for Large-scale Web Data Extraction. 2013 The
  First International Conference on Advanced Cloud and Big Data(CBD 2013),
  Nanjing, Dec. 13-15, 2013. 57.    Wenhui
  Zhou, Chunfeng Yuan, Rong Gu, Yihua Huang. Large Scale Nearest Neighbors
  Search Based on Neighborhood Graph. 2013 The First International Conference
  on Advanced Cloud and Big Data(CBD 2013), Nanjing, Dec. 13-15, 2013.  58.    Jinshuang
  Yan, Xiaoliang Yang, Rong Gu, Chunfeng Yuan, and Yihua Huang. Performance
  Optimization for Short MapReduce Job Execution in Hadoop. Proceedings of 2nd
  International Conference on Cloud and Green Computing and 2nd International
  Conference on Social Computing and Its Applications(CGC/SCA 2012), p 688-694,
  2012 59.    Tao
  Xie, Shengsheng Shi, Fuliang Quan, Chunfeng Yuan, and Yihua Huang. Research
  on Complex Structure-Oriented 
  Accurate Web Information Extraction Rules. Proceedings of the 2010
  IEEE International Conference on Progress in Informatics and Computing(PIC
  2010), p 312-316, 2010 60.    Xiaoliang
  Yang, Chunfeng Yuan, Yihua Huang. Parallization of BLAST with MapReduce for
  long sequence alignment. Proceedings - The 4th International Symposium on
  Parallel Architectures, Algorithms and Programming(PAAP 2011), p 241-246,
  2011 61.    Tao
  Xiao, Shuai Wang, Chunfeng Yuan, Yihua Huang. PSON: A Parallelized SON
  Algorithm with MapReduce for Mining Frequent Sets. The 4th International
  Symposium on Parallel Architectures, Algorithms and Programming(PAAP 2011), p
  252-257, 2011 62.   
  Yongzhuang Wei,
  Shuai Wang, Chunfeng Yuan, and Yihua Huang. Parallelized
  Near-Duplicate Document Detection Algorithm for Large Scale
  Chinese Web Pages. Proceedings of the 13th International Conference on
  Parallel and Distributed Computing, Applications and Technologies(PDCAT
  2012), p 523-529, 2012. 63.    Jian
  Zhang, Chunfeng Yuan, and Yihua Huang. Parallelized Similarity Flooding
  Algorithm for Processing Large Scale Graph Datasets with MapReduce.
  Proceedings of the 13th International Conference on Parallel and Distributed
  Computing, Applications and Technologies(PDCAT 2012), p 184-188, 2012.  64.    Yulong
  Liu, Shengsheng Shi, Chunfeng Yuan and Yihua Huang, Automated Text Data
  Extraction based on Unsupervised Small Sample Learning. The 7th
  Intellegent System and Knowledge Engineering (ISKE 2012), Dec. 15-17, 2012,
  Beijing. Chapter in book “Foundations and Applications of Intelligent
  Systems”, Advances in Intelligent Systems and Computing 213, p133-150,
  Springer-Verlag Berlin Heidelberg, 2013.  65.    Chunfeng
  Yuan, Yihua Huang, Zhesheng Zhang, Guihai Chen, Wanchun Dou. Improvements on
  Teaching Methods and Contents for the “Computer Organization and
  Architecture” Curriculum. Proceedings of International Conference on Scalable
  Computing and Communications - The 8th International Conference on Embedded
  Computing, ScalCom-EmbeddedCom 2009, p 560-565, 2009 66.    Jin
  Yu, Jianxin Yu, Yihua Huang. Design and Implementation of Embedded Networked
  Intelligent Chinese Checkers Game Software. International Conference on
  Automatic Control and Artificial Intelligence (ACAI2012), March 24-26,2012,
  Xiamen, China 67.   
  Yihua Huang,
  Tianyun Ni, Lei Zhou and Stanley Su. JXP4BIGI: a generalized, Java XML-based
  approach for biological information gathering and integration.
  Bioinformatics. Vol. 19 no. 18. 2003.  68.   
  Stanley Su, Chunbo
  Huang, Joachim Hammer, Yihua Huang, Haifei Li, Liu Wang, Youzhong Liu,
  Charnyote P., Minsso Lee, Herman Lam. An Internet-Based Negotiation Server
  for E-Commerce. The VLDB (Very Large Data Bases) Journal, Special Issue on
  E-Services. Vol. 10, 2001.  |