Papers for the course:

These papers are a stratified sample of work in GPU algorithms. The space of parallel algorithms covered in based on the so-called “Berkeley dwarfs” [1] , an attempt to describe what are the key algorithmic components from which most computationally-intensive applications can be built. The list is biased toward more recent work, although some early GPU papers—now almost a decade old—may also be found.

[1] Asanović, et al., “The Landscape of Parallel Computing Research: A View from Berkeley”, EECS Department, University of California, Berkeley, Dec. 2006, http://www2.eecs.berkeley.edu/Pubs/TechRpts/2006/EECS-2006-183.html.

List of Dwarfs

  1. Dense Linear Algebra
  2. Sparse Linear Algebra
  3. Spectral Methods
  4. N-Body Methods
  5. Structured Grids
  6. Unstructured Grids
  7. MapReduce
  8. Combinational Logic
  9. Graph Traversal
  10. Dynamic Programming
  11. Backtrack and Branch-and-Bound
  12. Graphical Models
  13. Finite State Machines

Papers for each Dwarf

  1. Dense Linear Algebra
  2. Sparse Linear Algebra
  3. Spectral Methods
  4. N-Body
    • Scaling Hierarchical N-body Simulations on GPU Clusters
      • Author(s): P. Jetley, L. Wesolowski, F. Gioachin, L. V. Kalé and T. R. Quinn
      • 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, New Orleans, LA, 2010
      • Presenter: Connor Scully-Allison
      • Paper d
  5. Structured Grids
    • For solving partial differential equations (PDEs). If you are interested in PDE solving, consult with Lee Barford to choose one or two papers to present.
  6. Unstructured Grids
    • For solving partial differential equations (PDEs). If you are interested in PDE solving, consult with Lee Barford to choose one or two papers to present
  7. MapReuduce
    • Multi-GPU MapReduce on GPU Clusters
      • Author(s): J. A. Stuart and J. D. Owens
      • 2011 IEEE International Parallel & Distributed Processing Symposium, Anchorage, AK, 2011
      • Presenter: Om Mishra
      • Paper a
  8. Combinatorial Logic
    • (don't use a GPU: use an FPGA)
  9. Graph traversal
  10. Dynamic Programming
  11. Backtrack and branch-and-bound
  12. Graphical Models
  13. Finite State Machine

Presentation Order

  • a - Om Mishra - Dwarf 7-1
  • b - Daniel Lopez - Dwarf 12-1
  • c - Blanca Miller - Dwarf 11-1
  • d - Connor Scully-Allison - Dwarf 4-1
  • e - Janelle Bankenburg - Dwarf 10-2
  • f - Steven Fisher - Dwarf 9-1
  • g - Syed Zawad - Dwarf 10-1
  • h - Hannah Munoz - Dwarf 9-3
  • i - Chad Adams - Dwarf 12-3
  • j - Eric Li - Dwarf 13-1
  • k - Aavaas Gajurel - Dwarf 12-2

/OL>