--- ---

Refereed Publications

Andrew Davidson, Sean Baxter, Michael Garland, and John D. Owens. Work-Efficient Parallel GPU Methods for Single Source Shortest Paths. In Proceedings of the 28th IEEE International Parallel and Distributed Processing Symposium, May 2014. [ bib | http ]

Jason Mak, Mauricio Hess-Flores, Shawn Recker, John D. Owens, and Kenneth I. Joy. GPU-Accelerated and Efficient Multi-View Triangulation for Scene Reconstruction. In Proceeedings of the IEEE Winter Conference on Applications of Computer Vision, March 2014. [ bib | http ]

Mohamed Ebeida, Scott Mitchell, Anjul Patney, Andrew Davidson, Stanley Tzeng, Muhammad Awad, Ahmed Mahmoud, and John D. Owens. Exercises in High-Dimensional Sampling: Maximal Poisson-disk Sampling and k-d Darts. In Janine Bennett, Fabien Vivodtzev, and Valerio Pascucci, editors, Topological and Statistical Methods for Complex Data – Tackling Large-Scale, High-Dimensional, and Multivariate Data Sets. Springer, 2014. [ bib ]

Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Keith R. Dalbey, Andrew A. Davidson, and John D. Owens. k-d Darts: Sampling by k-Dimensional Flat Searches. ACM Transactions on Graphics, 33:3:1–3:16, January 2014. [ bib | DOI ]

Mohamed S. Ebeida, Ahmed H. Mahmoud, Muhammad A. Awad, Mohammed A. Mohammed, Scott A. Mitchell, Alex Rand, and John D. Owens. Sifted Disks. Computer Graphics Forum, 32(2):509–518, May 2013. [ bib | DOI | .pdf ]

Stanley Tzeng, Brandon Lloyd, and John D. Owens. A GPU Task-Parallel Model with Dependency Resolution. IEEE Computer, 45(8):34–41, August 2012. [ bib | DOI | http ]

Shengren Li, Lance Simons, Jagadeesh Bhaskar Pakaravoor, Fatemeh Abbasinejad, John D. Owens, and Nina Amenta. kANN on the GPU with Shifted Sorting. In Proceedings of High Performance Graphics 2012, pages 39–47, June 2012. [ bib | DOI | http ]

Stanley Tzeng, Anjul Patney, Andrew Davidson, Mohamed S. Ebeida, Scott A. Mitchell, and John D. Owens. High-Quality Parallel Depth-of-Field Using Line Samples. In Proceedings of High Performance Graphics 2012, pages 23–31, June 2012. [ bib | DOI | http ]

Andrew Davidson, David Tarjan, Michael Garland, and John D. Owens. Efficient Parallel Merge Sort for Fixed and Variable Length Keys. In Proceedings of Innovative Parallel Computing, InPar '12, May 2012. [ bib | DOI | http ]

Mohamed S. Ebeida, Scott A. Mitchell, Anjul Patney, Andrew A. Davidson, and John D. Owens. A Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions. Computer Graphics Forum, 31(2):785–794, May 2012. [ bib | DOI | http ]

Kshitij Gupta, Jeff Stuart, and John D. Owens. A Study of Persistent Threads Style GPU Programming for GPGPU Workloads. In Proceedings of Innovative Parallel Computing, InPar '12, May 2012. [ bib | DOI | http ]

Ritesh A. Patel, Yao Zhang, Jason Mak, and John D. Owens. Parallel Lossless Data Compression on the GPU. In Proceedings of Innovative Parallel Computing, InPar '12, May 2012. [ bib | DOI | http ]

Andrew Davidson and John Owens. Toward Techniques for Auto-tuning GPU Algorithms. In Kristján Jónasson, editor, Applied Parallel and Scientific Computing, volume 7134 of Lecture Notes in Computer Science, pages 110–119. Springer Berlin / Heidelberg, February 2012. [ bib | DOI ]

Yao Zhang, John Ludd Recker, Robert Ulichney, Ingeborg Tastl, and John D. Owens. Plane-dependent Error Diffusion on a GPU. In Proceedings of SPIE: IS&T/SPIE Electronic Imaging 2012 / Parallel Processing for Imaging Applications II, volume 8295B, pages 8295B–59:1–10, January 2012. [ bib | DOI | http ]

Mohamed S. Ebeida, Anjul Patney, John D. Owens, and Eric Mestreau. Isotropic conforming refinement of quadrilateral and hexahedral meshes using two-refinement templates. International Journal for Numerical Methods in Engineering, 88(10):974–985, 9 December 2011. [ bib | DOI | http ]

Kshitij Gupta and John D. Owens. Compute & Memory Optimizations for High-Quality Speech Recognition on Low-End GPU Processors. In Proceedings of the 2011 International Conference on High Performance Computing (HiPC), December 2011. [ bib | DOI | http ]

Dan A. Alcantara, Vasily Volkov, Shubhabrata Sengupta, Michael Mitzenmacher, John D. Owens, and Nina Amenta. Building an Efficient Hash Table on the GPU. In Wen-mei W. Hwu, editor, GPU Computing Gems, volume 2, chapter 4, pages 39–53. Morgan Kaufmann, October 2011. [ bib | DOI ]

Mohamed S. Ebeida, Scott A. Mitchell, Andrew A. Davidson, Anjul Patney, Patrick M. Knupp, and John D. Owens. Efficient and Good Delaunay Meshes From Random Points. In Proceedings of the SIAM Conference on Geometric and Physical Modeling (GD/SPM11), pages 1506–1515, October 2011. [ bib | DOI | http ]

Mark Silberstein, Assaf Schuster, and John D. Owens. Applying Software-Managed Caching and CPU/GPU Task Scheduling for Accelerating Dynamic Workloads. In Wen-mei W. Hwu, editor, GPU Computing Gems, volume 2, chapter 36, pages 501–517. Morgan Kaufmann, October 2011. [ bib | DOI ]

Jeff A. Stuart, Pavan Balaji, and John D. Owens. Extending MPI to Accelerators. In ASBD 2011: First Workshop on Architectures and Systems for Big Data, pages 19–23, October 2011. [ bib | DOI | ACM DL | http ]

Yao Zhang, Jonathan Cohen, Andrew A. Davidson, and John D. Owens. A Hybrid Method for Solving Tridiagonal Systems on the GPU. In Wen-mei W. Hwu, editor, GPU Computing Gems, volume 2, chapter 11, pages 117–132. Morgan Kaufmann, October 2011. [ bib | DOI | http ]

John Jenkins, Isha Arkatkar, John D. Owens, Alok Choudhary, and Nagiza F. Samatova. Lessons Learned from Exploring the Backtracking Paradigm on the GPU. In Euro-Par 2011: Proceedings of the 17th International European Conference on Parallel and Distributed Computing, volume 6853 of Lecture Notes in Computer Science, pages 425–437. Springer, August/September 2011. [ bib | DOI | http ]

Everett H. Phillips, Yao Zhang, Roger L. Davis, and John D. Owens. Acceleration of 2-D Compressible Flow Solvers with Graphics Processing Unit Clusters. Journal of Aerospace Computing, Information, and Communication, 8(8):237–249, August 2011. [ bib | DOI | http ]

Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Andrew Davidson, Patrick M. Knupp, and John D. Owens. Efficient Maximal Poisson-Disk Sampling. ACM Transactions on Graphics, 30(4):49:1–49:12, July 2011. [ bib | DOI | ACM DL | http ]

Jeff A. Stuart, Michael Cox, and John D. Owens. GPU-to-CPU Callbacks. In Euro-Par 2010 Workshops: Proceedings of the Third Workshop on UnConventional High Performance Computing (UCHPC 2010), volume 6586 of Lecture Notes in Computer Science, pages 365–372. Springer, July 2011. [ bib | DOI | http ]

Vladimir Glavtchev, Pınar Muyan-Özçelik, Jeffery M. Ota, and John D. Owens. Feature-Based Speed Limit Sign Detection Using a Graphics Processing Unit. In Proceedings of the 2011 IEEE Intelligent Vehicles Symposium, pages 195–200, June 2011. [ bib | DOI | http ]

Christopher P. Stone, Earl P. N. Duque, Yao Zhang, David Car, John D. Owens, and Roger L. Davis. GPGPU parallel algorithms for structured-grid CFD codes. In Proceedings of the 20th AIAA Computational Fluid Dynamics Conference, number 2011-3221, June 2011. [ bib | http ]

Andrew Davidson, Yao Zhang, and John D. Owens. An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU. In Proceedings of the 25th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2011, pages 956–965, May 2011. [ bib | DOI | http ]

Jeff A. Stuart and John D. Owens. Multi-GPU MapReduce on GPU Clusters. In Proceedings of the 25th IEEE International Parallel and Distributed Processing Symposium, pages 1068–1079, May 2011. [ bib | DOI | http ]

Andrew Davidson and John D. Owens. Register Packing for Cyclic Reduction: A Case Study. In Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, pages 4:1–4:6, March 2011. [ bib | DOI | ACM DL | http ]

Pınar Muyan-Özçelik, Vladimir Glavtchev, Jeffrey M. Ota, and John D. Owens. Real-Time Speed-Limit-Sign Recognition on an Embedded System Using a GPU. In Wen-mei W. Hwu, editor, GPU Computing Gems, volume 1, chapter 32, pages 497–516. Morgan Kaufmann, February 2011. [ bib | DOI | http ]

Yao Zhang and John D. Owens. A Quantitative Performance Analysis Model for GPU Architectures. In Proceedings of the 17th IEEE International Symposium on High-Performance Computer Architecture (HPCA 17), pages 382–393, February 2011. [ bib | DOI | http ]

Shubhabrata Sengupta, Mark Harris, Michael Garland, and John D. Owens. Efficient Parallel Scan Algorithms for many-core GPUs. In Jakub Kurzak, David A. Bader, and Jack Dongarra, editors, Scientific Computing with Multicore and Accelerators, Chapman & Hall/CRC Computational Science, chapter 19, pages 413–442. Taylor & Francis, January 2011. [ bib | DOI | http ]

Yao Zhang, John Ludd Recker, Robert Ulichney, Giordano B. Beretta, Ingeborg Tastl, I-Jong Lin, and John D. Owens. A Parallel Error Diffusion Implementation on a GPU. In Proceedings of SPIE: IS&T/SPIE Electronic Imaging 2011 / Parallel Processing for Imaging Applications, volume 7872, pages 78720K:1–9, January 2011. [ bib | DOI | http ]

Pınar Muyan-Özçelik, Vladimir Glavtchev, Jeffery M. Ota, and John D. Owens. A Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System using GPU Computing. In Michael Goesele, Stefan Roth, Arjan Kuijper, Bernt Schiele, and Konrad Schindler, editors, DAGM 2010: Proceedings of the 32nd Annual Symposium of the German Association for Pattern Recognition, volume 6376 of Lecture Notes in Computer Science, pages 162–171. Springer, September 2010. [ bib | DOI | http ]

Andrew Davidson and John D. Owens. Toward Techniques for Auto-Tuning GPU Algorithms. In Para 2010: State of the Art in Scientific and Parallel Computing, June 2010. [ bib | http ]

Anjul Patney, Stanley Tzeng, and John D. Owens. Fragment-Parallel Composite and Filter. Computer Graphics Forum (Proceedings of the Eurographics Symposium on Rendering), 29(4):1251–1258, June 2010. [ bib | DOI | http ]

Everett H. Phillips, Roger L. Davis, and John D. Owens. Unsteady Turbulent Simulations on a Cluster of Graphics Processors. In Proceedings of the 40th AIAA Fluid Dynamics Conference, number AIAA 2010-5036, June 2010. [ bib | http ]

Jeff A. Stuart, Cheng-Kai Chen, Kwan-Liu Ma, and John D. Owens. Multi-GPU Volume Rendering using MapReduce. In HPDC '10: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing / MAPREDUCE '10: The First International Workshop on MapReduce and its Applications, pages 841–848, June 2010. [ bib | DOI | ACM DL | http ]

Stanley Tzeng, Anjul Patney, and John D. Owens. Task Management for Irregular-Parallel Workloads on the GPU. In Proceedings of High Performance Graphics 2010, pages 29–37, June 2010. [ bib | DOI | http ]

Yao Zhang, Jonathan Cohen, and John D. Owens. Fast Tridiagonal Solvers on the GPU. In Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2010), pages 127–136, January 2010. [ bib | DOI | ACM DL | http ]

Dan A. Alcantara, Andrei Sharf, Fatemeh Abbasinejad, Shubhabrata Sengupta, Michael Mitzenmacher, John D. Owens, and Nina Amenta. Real-Time Parallel Hashing on the GPU. ACM Transactions on Graphics, 28(5):154:1–154:9, December 2009. [ bib | DOI | ACM DL | http ]

Kshitij Gupta and John D. Owens. Three-Layer Optimizations for Fast GMM Computations on GPU-like Parallel Processors. In Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, pages 146–151, December 2009. [ bib | DOI | http ]

Anjul Patney, Mohamed S. Ebeida, and John D. Owens. Parallel View-Dependent Tessellation of Catmull-Clark Subdivision Surfaces. In Proceedings of High Performance Graphics 2009, pages 99–108, August 2009. [ bib | DOI | ACM DL | http ]

Luke J. Gosink, Kesheng Wu, E. Wes Bethel, John D. Owens, and Kenneth I. Joy. Data Parallel Bin-Based Indexing for Answering Queries on Multi-Core Architectures. In Proceedings of the 21st International Conference on Scientific and Statistical Database Management, volume 5566 of Lecture Notes in Computer Science, pages 110–129. Springer, June 2009. [ bib | DOI | http ]

Jeff A. Stuart and John D. Owens. Message Passing on Data-Parallel Architectures. In Proceedings of the 23rd IEEE International Parallel and Distributed Processing Symposium, May 2009. [ bib | DOI | http ]

Brian Budge, Tony Bernardin, Jeff A. Stuart, Shubhabrata Sengupta, Kenneth I. Joy, and John D. Owens. Out-of-core Data Management for Path Tracing on Hybrid Resources. Computer Graphics Forum (Proceedings of Eurographics 2009), 28(2):385–396, April 2009. [ bib | DOI | http ]

Everett H. Phillips, Yao Zhang, Roger L. Davis, and John D. Owens. Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units. In Proceedings of the 47th AIAA Aerospace Sciences Meeting, number AIAA 2009-565, January 2009. [ bib | http ]

Anjul Patney and John D. Owens. Real-Time Reyes-Style Adaptive Surface Subdivision. ACM Transactions on Graphics, 27(5):143:1–143:8, December 2008. [ bib | DOI | ACM DL | http ]

Sanjiv S. Samant, Junyi Xia, Pınar Muyan-Özçelik, and John D. Owens. High performance computing for deformable image registration: Towards a new paradigm in adaptive radiotherapy. Medical Physics, 35(8):3546–3553, August 2008. [ bib | DOI ]

Pınar Muyan-Özçelik, John D. Owens, Junyi Xia, and Sanjiv S. Samant. Fast Deformable Registration on the GPU: A CUDA Implementation of Demons. In Proceedings of the 2008 International Conference on Computational Science and Its Applications (First Technical Session on UnConventional High Performance Computing [UCHPC '08]), pages 223–233, July 2008. [ bib | DOI | http ]

Mark Silberstein, Assaf Schuster, Dan Geiger, Anjul Patney, and John D. Owens. Efficient Computation of Sum-products on GPUs Through Software-Managed Cache. In Proceedings of the 22nd ACM International Conference on Supercomputing, pages 309–318, June 2008. [ bib | DOI | ACM DL | http ]

John D. Owens, Mike Houston, David Luebke, Simon Green, John E. Stone, and James C. Phillips. GPU Computing. Proceedings of the IEEE, 96(5):879–899, May 2008. [ bib | DOI | http ]

Adam Moerschell and John D. Owens. Distributed Texture Memory in a Multi-GPU Environment. Computer Graphics Forum, 27(1):130–151, March 2008. [ bib | DOI | http ]

Aaron E. Lefohn, Shubhabrata Sengupta, and John D. Owens. Resolution-Matched Shadow Maps. ACM Transactions on Graphics, 26(4):20:1–20:17, October 2007. [ bib | DOI | ACM DL | http ]

John D. Owens, William J. Dally, Ron Ho, D. N. Jayasimha, Stephen W. Keckler, and Li-Shiuan Peh. Research Challenges for On-Chip Interconnection Networks. IEEE Micro, 27:96–108, September/October 2007. [ bib | DOI | .html ]

Mark Harris, Shubhabrata Sengupta, and John D. Owens. Parallel Prefix Sum (Scan) with CUDA. In Hubert Nguyen, editor, GPU Gems 3, chapter 39, pages 851–876. Addison Wesley, August 2007. [ bib | http ]

Shubhabrata Sengupta, Mark Harris, Yao Zhang, and John D. Owens. Scan Primitives for GPU Computing. In Graphics Hardware 2007, pages 97–106, August 2007. Best Paper Award. [ bib | DOI | http ]

Kwan-Liu Ma, Robert Ross, Jian Huang, Greg Humphreys, Nelson Max, Kenneth Moreland, John D. Owens, and Han-Wei Shen. Ultra-Scale Visualization: Research and Education. Journal of Physics: Conference Series, 78:012088 (6pp), June 2007. [ bib | DOI | http ]

John D. Owens. Towards Multi-GPU Support for Visualization. Journal of Physics: Conference Series, 78:012055 (5pp), June 2007. [ bib | DOI | http ]

John D. Owens, David Luebke, Naga Govindaraju, Mark Harris, Jens Krüger, Aaron E. Lefohn, and Tim Purcell. A Survey of General-Purpose Computation on Graphics Hardware. Computer Graphics Forum, 26(1):80–113, March 2007. [ bib | DOI | http ]

John D. Owens. The Installation and Use of OpenType Fonts in LATEX. TUGboat: Communications of the TEX Users Group, 27(2):112–118, December 2006. [ bib | http ]

Adam Moerschell and John D. Owens. Distributed Texture Memory in a Multi-GPU Environment. In Graphics Hardware 2006, pages 31–38, September 2006. [ bib | DOI | ACM DL | http ]

Leo Szumel and John D. Owens. The Virtual Pheromone Communication Primitive. In Phillip B. Gibbons, Tarek Abdelzaher, James Aspnes, and Ramesh Rao, editors, Proceedings of the Second IEEE International Conference on Distributed Computing in Sensor Systems, volume 4026 of Lecture Notes in Computer Science, pages 135–149. Springer, June 2006. [ bib | DOI | http ]

Aaron E. Lefohn, Shubhabrata Sengupta, Joe Kniss, Robert Strzodka, and John D. Owens. Glift: Generic Data Structures for the GPU. In Proceedings of the 2006 Workshop on Edge Computing Using New Commodity Architectures, pages D–15–16, May 2006. [ bib | http ]

Shubhabrata Sengupta, Aaron E. Lefohn, and John D. Owens. A Work-Efficient Step-Efficient Prefix Sum Algorithm. In Proceedings of the 2006 Workshop on Edge Computing Using New Commodity Architectures, pages D–26–27, May 2006. [ bib | http ]

Sung W. Park, Lars Linsen, Oliver Kreylos, John D. Owens, and Bernd Hamann. Discrete Sibson Interpolation. IEEE Transactions on Visualization and Computer Graphics, 12(2):243–253, March/April 2006. [ bib | DOI | http ]

Aaron E. Lefohn, Joe Kniss, Robert Strzodka, Shubhabrata Sengupta, and John D. Owens. Glift: Generic, Efficient, Random-Access GPU Data Structures. ACM Transactions on Graphics, 25(1):60–99, January 2006. [ bib | DOI | ACM DL | http ]

Sung Park, Lars Linsen, Oliver Kreylos, John D. Owens, and Bernd Hamann. A Framework for Real-Time Volume Visualization of Streaming Scattered Data. In Proceedings of the Tenth International Fall Workshop on Vision, Modeling, and Visualization, pages 225–232, November 2005. [ bib | http ]

Joe Kniss, Aaron Lefohn, Shubhabrata Sengupta, Robert Strzodka, and John D. Owens. Octree Textures on Graphics Hardware. In Technical Sketches Program, ACM SIGGRAPH 2005, August 2005. [ bib | DOI | ACM DL | http ]

Aaron Lefohn, Shubhabrata Sengupta, Joe Kniss, Robert Strzodka, and John D. Owens. Dynamic Adaptive Shadow Maps on Graphics Hardware. In Technical Sketches Program, ACM SIGGRAPH 2005, August 2005. [ bib | DOI | ACM DL | http ]

John D. Owens, David Luebke, Naga Govindaraju, Mark Harris, Jens Krüger, Aaron E. Lefohn, and Tim Purcell. A Survey of General-Purpose Computation on Graphics Hardware. In Eurographics 2005, State of the Art Reports, pages 21–51, August 2005. [ bib | http ]

Leo Szumel, Jason LeBrun, and John D. Owens. Towards a Mobile Agent Framework for Sensor Networks. In Proceedings of the Second IEEE Workshop on Embedded Networked Sensors (EmNetS-II), pages 79–87, May 2005. [ bib | DOI | .html ]

Aaron Lefohn, Joe Kniss, and John Owens. Implementing Efficient Parallel Data Structures on GPUs. In Matt Pharr, editor, GPU Gems 2, chapter 33, pages 521–545. Addison Wesley, March 2005. [ bib | http ]

John Owens. Streaming Architectures and Technology Trends. In Matt Pharr, editor, GPU Gems 2, chapter 29, pages 457–470. Addison Wesley, March 2005. [ bib | http ]

Andrew T. Riffel, Aaron E. Lefohn, Kiril Vidimce, Mark Leone, and John D. Owens. Mio: Fast Multipass Partitioning via Priority-Based Instruction Scheduling. In Graphics Hardware 2004, pages 35–44, August 2004. [ bib | DOI | ACM DL | http ]

Ujval J. Kapasi, Scott Rixner, William J. Dally, Brucek Khailany, Jung Ho Ahn, Peter Mattson, and John D. Owens. Programmable Stream Processors. IEEE Computer, 36(8):54–62, August 2003. [ bib | DOI | http ]

Brucek Khailany, William J. Dally, Scott Rixner, Ujval J. Kapasi, John D. Owens, and Brian Towles. Exploring the VLSI Scalability of Stream Processors. In Proceedings of the Ninth Annual International Symposium on High-Performance Computer Architecture, pages 153–164, February 2003. [ bib | DOI | http ]

Ujval J. Kapasi, William J. Dally, Brucek Khailany, John D. Owens, and Scott Rixner. The Imagine Stream Processor. In Proceedings of the IEEE International Conference on Computer Design, pages 282–288, Freiburg, Germany, September 2002. [ bib | DOI | http ]

John D. Owens, Brucek Khailany, Brian Towles, and William J. Dally. Comparing Reyes and OpenGL on a Stream Architecture. In Graphics Hardware 2002, pages 47–56, September 2002. [ bib | http ]

John D. Owens, Ujval J. Kapasi, Peter Mattson, Brian Towles, Ben Serebrin, Scott Rixner, and William J. Dally. Media Processing Applications on the Imagine Stream Processor. In Proceedings of the IEEE International Conference on Computer Design, pages 295–302, Freiburg, Germany, September 2002. [ bib | DOI | http ]

Ben Serebrin, John D. Owens, Brucek Khailany, Peter Mattson, Ujval J. Kapasi, Chen H. Chen, Jinyung Namkoong, Stephen P. Crago, Scott Rixner, and William J. Dally. A Stream Processor Development Platform. In Proceedings of the IEEE International Conference on Computer Design, pages 303–308, Freiburg, Germany, September 2002. [ bib | DOI | .pdf ]

Ujval J. Kapasi, Peter Mattson, William J. Dally, John D. Owens, and Brian Towles. Stream Scheduling. In Proceedings of the 3rd Workshop on Media and Streaming Processors, pages 101–106, Austin, TX, 2 December 2001. [ bib | http ]

Brucek Khailany, William J. Dally, Ujval J. Kapasi, Peter Mattson, Jinyung Namkoong, John D. Owens, Brian Towles, Andrew Chang, and Scott Rixner. Imagine: Media Processing with Streams. IEEE Micro, 21(2):35–46, March/April 2001. [ bib | DOI | http ]

Ujval J. Kapasi, William J. Dally, Scott Rixner, Peter R. Mattson, John D. Owens, and Brucek Khailany. Efficient Conditional Operations for Data-parallel Architectures. In Proceedings of the 33rd Annual ACM/IEEE International Symposium on Microarchitecture, pages 159–170, December 2000. [ bib | DOI | ACM DL | http ]

Peter Mattson, William J. Dally, Scott Rixner, Ujval J. Kapasi, and John D. Owens. Communication Scheduling. In Proceedings of the Ninth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 82–92, November 2000. [ bib | DOI | ACM DL | .pdf ]

Brucek Khailany, William J. Dally, Scott Rixner, Ujval J. Kapasi, Peter Mattson, Jin Namkoong, John D. Owens, and Brian Towles. Imagine: Signal and Image Processing Using Streams. In Hotchips 12, August 2000. [ bib | http ]

John D. Owens, William J. Dally, Ujval J. Kapasi, Scott Rixner, Peter Mattson, and Ben Mowery. Polygon Rendering on a Stream Architecture. In 2000 SIGGRAPH/Eurographics Workshop on Graphics Hardware, pages 23–32, August 2000. [ bib | DOI | ACM DL | http ]

Scott Rixner, William J. Dally, Ujval J. Kapasi, Peter Mattson, and John D. Owens. Memory Access Scheduling. In Proceedings of the 27th International Symposium on Computer Architecture, pages 128–138, June 2000. [ bib | DOI | ACM DL | .pdf ]

Scott Rixner, William J. Dally, Brucek Khailany, Peter Mattson, Ujval Kapasi, and John D. Owens. Register Organization for Media Processing. In Proceedings of the Sixth Annual International Symposium on High-Performance Computer Architecture, pages 375–386, January 2000. [ bib | DOI | .pdf ]

Scott Rixner, William J. Dally, Ujval J. Kapasi, Brucek Khailany, Abelardo Lopez-Lagunas, Peter Mattson, and John D. Owens. A Bandwidth-Efficient Architecture for Media Processing. In Proceedings of the 31st Annual ACM/IEEE International Symposium on Microarchitecture, pages 3–13, December 1998. [ bib | DOI | .pdf ]

Non-Refereed Publications

Anjul Patney, Stanley Tzeng, Kerry A. Seitz, Jr., and John D. Owens. Piko: A Design Framework for Programmable Graphics Pipelines. CoRR, abs/1404.6293(1404.6293v1), April 2014. [ bib | arXiv ]

Kerry A. Seitz, Jr., Alex Kennedy, Owen Ransom, Bassam A. Younis, and John D. Owens. A GPU Implementation for Two-Dimensional Shallow Water Modeling. CoRR, abs/1309.1230(1309.1230v1), September 2013. [ bib | arXiv ]

Stanley Tzeng and John D. Owens. Finding Convex Hulls Using Quickhull on the GPU. CoRR, abs/1201.2936(1201.2936v1), January 2012. [ bib | arXiv ]

Jeff A. Stuart and John D. Owens. Efficient Synchronization Primitives for GPUs. CoRR, abs/1110.4623(1110.4623v1), October 2011. [ bib | arXiv ]

Luke J. Gosink, Kesheng Wu, E. Wes Bethel, John D. Owens, and Kenneth I. Joy. Bin-Hash Indexing: A Parallel Method For Fast Query Processing. Technical Report LBNL-729E, Lawrence Berkeley National Laboratory, 20 August 2008. [ bib | .pdf ]

Michael Kass, Aaron Lefohn, and John Owens. Interactive Depth of Field Using Simulated Diffusion on a GPU. Technical Report #06-01, Pixar Animation Studios, January 2006. http://graphics.pixar.com/library/DepthOfField. [ bib | http ]

John D. Owens, Shubhabrata Sengupta, and Daniel Horn. Assessment of Graphic Processing Units (GPUs) for Department of Defense (DoD) Digital Signal Processing (DSP) Applications. Technical Report ECE-CE-2005-3, Department of Electrical and Computer Engineering, University of California, Davis, October 2005. http://www.ece.ucdavis.edu/cerl/techreports/2005-3/. [ bib | http ]

John D. Owens. GPUs tapped for general computing. EE Times, 13 December 2004. http://www.eet.com/news/latest/showArticle.jhtml?articleID=55300884. [ bib | http ]

John D. Owens. On The Scalability of Sensor Network Routing and Compression Algorithms. Technical Report ECE-CE-2004-1, Computer Engineering Research Laboratory, University of California, Davis, 2004. http://www.ece.ucdavis.edu/cerl/techreports/2004-1/. [ bib | http ]

Leo Szumel and John D. Owens. On the Feasibility of the UC Davis Metanet. Technical Report ECE-CE-2003-2, Computer Engineering Research Laboratory, University of California, Davis, 2003. http://www.ece.ucdavis.edu/cerl/techreports/2003-2/. [ bib | http ]

John D. Owens. Computer Graphics on a Stream Architecture. PhD thesis, Stanford University, November 2002. [ bib | http ]


This file was generated by bibtex2html 1.97.

---