• Accelerating Sparse Matrix Vector Multiplication on Many-core GPUs

  • by Weizhi Xu, Zhiyong Liu, Dongrui Fan, Shuai Jiao, Xiaochun Ye, Fenglong Song, and Chenggang Yan
  • International Conference on Computer and Information Technology,   2012,    
  • Godson-T: An Efficient Many-core Processor Exploring Thread-level Parallelism

  • by Dongrui Fan,Hao Zhang,Da Wang,Xiaochun Ye,Fenglong Song,Guojie Li,Ninghui Sun
  • IEEE Computer Society,   2012,   Micro 
  • High-Efficient Architecture of Godson-T Many-Core Processor

  • by Dongrui Fan, Hao Zhang, Da Wang, Xiaochun Ye, Fenglong Song, Junchao Zhang, and Lingjun Fan
  • the 23rd HotChips Conference,   2011-08,   Stanford Uniersity, CA, Session 1.3 
  • Scalability Study of Molecular Dynamics Simulation on Godson-T Many-core Architecture

  • by Liu Peng,Guangming Tan,Rajiv K. Kalia,Aiichiro Nakano,Priya Vashishta,Dongrui Fan,Hao Zhang,Fenglong Song
  • 2011-06-01,   MRSC 2011 
  • Optimizing the Barnes-Hut Algorithm in UPC

  • by Junchao Zhang, Babak Behzad, Marc Snir
  • International Conference for High Performance Computing, Networking, Storage and Analysis (SC),   2011,   Seattle, WA 
  • Extendable Pattern-Oriented Directives

  • by Huimin Cui, Jingling Xue, Lei Wang, Xiaobing Feng, Yang Yang, and Dongrui Fan
  • 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization,   2011,   pp.107-118 
  • A Case Study: Low Power Design-for-Testability Features of a Multi-core Processor Godson-T

  • by Da Wang, Dongrui Fan, Yu Hu
  • In Proc. of International Conference of Advanced Measurement and Test,   2011,   vol.302, pp.1237-1242. 
  • Optimizing Web Browser on Many-Core Architectures

  • by Lingjun Fan, Weisong Shi, Shibin Tang,Chenggang Yan, Dongrui Fan
  • The 12th International Conference on Parallel and Distributed Computing,   2011,   Applications and Technologies, Soul Korea, pp. 173-178 
  • New Methodologies for Parallel Architecture

  • by Dongrui Fan, Xiaowei Li, Guojie Li
  • 2011,   JCST, 26(4): 578-587 
  • High Performance Comparison-Based Sorting Algorithm on Many-Core GPUs

  • by Xiaochun Ye,Dongrui Fan,Wei Lin,Nan Yuan, Paolo Ienne
  • IPDPS 2010,   2010.4,   pp.1-10 
  • Minimal Multi-Threading: Finding and Removing Redundant Instructions in Multi-Threaded Processors

  • by Guoping Long, Diana Franklin, Susmit Biswas, Pablo Ortiz, Jason Oberg, Dongrui Fan, Frederic T. Chong
  • Micro 43,   2010.12,   pp.337-348 
  • GVE: Godson-T Verification Engine for manycore architecture Rapid prototyping and debuging

  • by ZhengMeng Lei, Lunkai Zhang, Fenglong Song, Shibin Tang, Dongrui Fan
  • FPT'10,   2010.12,   pp.253-256 
  • Efficient Address Mapping of Shared Cache for On-Chip Many-Core Architecture

  • by Fenglong Song, Dongrui Fan, Zhiyong Liu, Junchao Zhang, Lei Yu
  • In Proceedings of Euro-Par 2010 Conference.,   2010,   pp. 280-291 
  • Thread Owned Block Cache: Managing Latency in Many-Core Architecture

  • by Fenglong Song, Zhiyong Liu, Dongrui Fan, Hao Zhang, Lei Yu, Shibin Tang.
  • Euro-Par 2010,   2010,   pp. 292-303 
  • P-GAS: Parallelizing a Cycle-Accurate Event-Driven Many-Core Processor Simulator Using Parallel Discrete Event Simulation

  • by Huiwei Lv, Yuan Cheng, Lu Bai, Mingyu Chen, Dongrui Fan, and Ninghui Sun
  • PADS2010,   2010,   pp.1-8 
  • Landing Stencil Code on Godson-T

  • by Huimin Cui, Lei Wang, Dongrui Fan, Xiaobing Feng
  • J. Comput. Sci. Technol,   2010,   pp.886-894 
  • A Synchronization-Based Alternative to Directory Protocol

  • by He Huang, Lei Liu, Nan Yuan, Wei Lin, Fenglong Song, Junchao Zhang, Dongrui Fan
  • ISPA-09,   2009.8,   pp.175-181 
  • Evaluation Method of Synchronization for Shared-Memory On-Chip Many-Core Processor

  • by Fenglong Song, Zhiyong Liu, Dongrui Fan, He Huang, Nan Yuan, Lei Yu, Junchao Zhang
  • ISPA-09,   2009.8,   pp.571-576 
  • Data Management: The Spirit to Pursuit Peak Performance on Many-Core Processor

  • by Yongbin Zhou, Junchao Zhang, Shuai Zhang, Nan Yuan, Fan Dongrui
  • ISPA-09,   2009.8,   pp.559-564 
  • Study on Fine-grained Synchronization in Many-Core Architecture

  • by Lei Yu, Zhiyong Liu, Dongrui Fan, Fenglong Song, Junchao Zhang,Nan Yuan
  • SNPD 2009,   2009.5,   pp.524-529 
  • Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions

  • by Dongrui Fan, Nan Yuan, Junchao Zhang, Yongbin Zhou, Wei Lin, Fenglong Song, Xiaochun Ye, He Huang, Lei Yu, Guoping Long, Hao Zhang, Lei Liu
  • Journal of Computer Science and Technology,   2009.11,   24(6):1061-1073 
  • A Fast Linear-Space Sequence Alignment Algorithm with Dynamic Parallelization Framework

  • by Xiaochun Ye, Dongrui Fan, Wei Lin
  • CIT2009,   2009.10,   pp.274-279 
  • A Low-Complexity Synchronization Based Cache Coherence Solution for Many Cores

  • by Wei Lin, Nan Yuan, Dongrui Fan, He Huang
  • CIT2009,   2009.10,   pp.69-75 
  • Software and Hardware Cooperate for 1-D FFT Algorithm Optimization on Multicore Processors

  • by Yongbin Zhou, Junchao Zhang, Dongrui Fan
  • CIT2009,   2009.10,   pp.86-91 
  • Architectural Support for Cilk Computations on Many-core Architectures

  • by Guoping Long, Dongrui Fan, Junchao Zhang
  • PPoPP Poster,   2009,   pp.285-286 
  • Soft Coherence: Preliminary Experiments with Error-Tolerant Cache Coherence in Numerical Applications

  • by Guoping Long, Dongrui Fan, Frederic T. Chong
  • CMP-MSI ,   2009,   pp.15-43 
  • Characterizing and Understanding the Bandwidth Behavior of Workloads on Multi-core

  • by Guoping Long, Dongrui Fan, Junchao Zhang
  • Euro-Par,   2009,   pp.110-121 
  • High Performance Matrix Multiplication on Many Cores

  • by Nan Yuan, Yongbin Zhou, Guangming Tan, Junchao Zhang, Dongrui Fan
  • In proceedings of the 15th international Euro-Par conference on Parallel Processing,   2009,   pp.948-959 
  • Broadcast in Extended PEM Model

  • by Ergude Bao, Dong-Rui Fan, Xiao-Yu Ma, Nan Yuan, Wei-Sheng Li, Yang Yang
  • HPCS09,   2009,    
  • An Efficient and Flexible Task Management for Many Cores

  • by Nan Yuan, Lei Yu, Dongrui Fan
  • LNCS Transactions on High-Performance Embedded Architectures and Compilers,   2009,   pp.285-286 
  • Design of New Hash Mapping Functions

  • by Fenglong Song, Zhiyong Liu, Dongrui Fan, Junchao Zhang, Lei Yu, Nan Yuan, Wei Lin
  • CIT2009,   2009,   pp.45-50. 
  • Location Consistency Model Revisited -Problem, Solution and Prospects

  • by Guoping Long, Nan Yuan, Dongrui Fan
  • 2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies.,   2008,   pp.91-98 
  • Efficient Parallelization of a Protein Sequence Comparison Algorithm on Manycore Architecture

  • by Xiaochun Ye, Van Hoa Nguyen, Dominique Lavenier, Dongrui Fan
  • Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies,   2008,   pp.167-170 
  • A Quantitative Study of the On-Chip Network and Memory Hierarchy Design for Many-Core Processor

  • by Xu Wang, Ge Gan, Joseph Manzano, Dongrui Fan, Shuxu Guo
  • 14th IEEE International Conference on Parallel and Distributed Systems.,   2008,   pp.689-696 
  • A Performance Model of Dense Matrix Operations on Many-Core Architectures

  • by Guoping Long, Dongrui Fan, Junchao Zhang, Fenglong Song, Nan Yuan, and Wei Lin
  • Euro-Par,   2008,   pp.120-129 
  • Experience on Optimizing Irregular Computation for Memory Hierarchy in Manycore Architecture

  • by Guangming Tan, Nan Yuan, Dongrui Fan, Andrew Russo, Guang R. Gao
  • PPoPP'08,      pp.279-280