John Owens / Electrical and Computer Engineering / UC Davis

John Owens's calculated h-index is 60. This page was automatically generated on 2026-07-21.

3390	Owens:2007:ASO	A Survey of General-Purpose Computation on Graphics Hardware
3260	Owens:2008:GC	GPU Computing
2677	Liu:2020:EOD	Energy-based Out-of-distribution Detection
1362	Rixner:2000:MAS	Memory Access Scheduling
1224	Harris:2007:PPS	Parallel Prefix Sum (Scan) with CUDA
856	Sengupta:2007:SPF	Scan Primitives for GPU Computing
783	Wang:2016:GAH	Gunrock: A High-Performance Graph Processing Library on the GPU
637	Owens:2007:RCF	Research Challenges for On-Chip Interconnection Networks
491	Khailany:2001:IMP	Imagine: Media Processing with Streams
443	Kapasi:2003:PSP	Programmable Stream Processors
409	Zhang:2011:AQP	A Quantitative Performance Analysis Model for GPU Architectures
390	Rixner:2000:ROF	Register Organization for Media Processing
369	Kepner:2016:MFO	Mathematical Foundations of the GraphBLAS
361	Rixner:1998:ABA	A Bandwidth-Efficient Architecture for Media Processing
360	Kapasi:2002:TIS	The Imagine Stream Processor
347	Zhang:2010:FTS	Fast Tridiagonal Solvers on the GPU
336	Gupta:2012:ASO	A Study of Persistent Threads Style GPU Programming for GPGPU Workloads
286	Alcantara:2009:RPH	Real-Time Parallel Hashing on the GPU
279	Davidson:2014:WPG	Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths
276	Stuart:2011:MMO	Multi-GPU MapReduce on GPU Clusters
227	Lefohn:2006:GGE	Glift: Generic, Efficient, Random-Access GPU Data Structures
204	Wang:2017:GGG	Gunrock: GPU Graph Analytics
189	Yang:2018:DPF	Design Principles for Sparse Matrix Multiplication on the GPU
182	Yang:2022:GAH	GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
180	Owens:2005:SAA	Streaming Architectures and Technology Trends
178	Tzeng:2010:TMF	Task Management for Irregular-Parallel Workloads on the GPU
170	Muyan-Ozcelik:2008:FDR	Fast Deformable Registration on the GPU: A CUDA Implementation of Demons
164	Park:2006:DSI	Discrete Sibson Interpolation
154	Patel:2012:PLD	Parallel Lossless Data Compression on the GPU
154	Silberstein:2008:ECO	Efficient Computation of Sum-products on GPUs Through Software-Managed Cache
148	Samant:2008:HPC	High performance computing for deformable image registration: Towards a new paradigm in adaptive radiotherapy
146	Ebeida:2012:ASA	A Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions
146	Ebeida:2011:EMP	Efficient Maximal Poisson-Disk Sampling
144	Kapasi:2000:ECO	Efficient Conditional Operations for Data-parallel Architectures
139	Kass:2006:IDO	Interactive Depth of Field Using Simulated Diffusion on a GPU
127	Sengupta:2006:AWS	A Work-Efficient Step-Efficient Prefix Sum Algorithm
121	Davidson:2011:AAM	An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU
119	Owens:2002:MPA	Media Processing Applications on the Imagine Stream Processor
114	Ashkiani:2018:ADH	A Dynamic Hash Table for the GPU
106	Lefohn:2007:RSM	Resolution-Matched Shadow Maps
103	Davidson:2012:EPM	Efficient Parallel Merge Sort for Fixed and Variable Length Keys
102	Pan:2017:MGA	Multi-GPU Graph Analytics
102	Stuart:2009:MPO	Message Passing on Data-Parallel Architectures
99	Phillips:2009:RAP	Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units
97	Owens:2000:PRO	Polygon Rendering on a Stream Architecture
87	Alcantara:2011:BAE	Building an Efficient Hash Table on the GPU
82	Stuart:2010:MVR	Multi-GPU Volume Rendering using MapReduce
78	Awad:2019:EAH	Engineering a High-Performance GPU B-Tree
76	Wang:2016:ACS	A Comparative Study on Exact Triangle Counting Algorithms on the GPU
75	Budge:2009:ODM	Out-of-core Data Management for Path Tracing on Hybrid Resources
75	Patney:2008:RRA	Real-Time Reyes-Style Adaptive Surface Subdivision
73	Khailany:2003:ETV	Exploring the VLSI Scalability of Stream Processors
71	Jenkins:2011:LLF	Lessons Learned from Exploring the Backtracking Paradigm on the GPU
71	Stuart:2011:ESP	Efficient Synchronization Primitives for GPUs
70	Abdelkader:2020:VVM	VoroCrust: Voronoi Meshing Without Clipping
70	Kapasi:2001:SS	Stream Scheduling
68	Mattson:2000:CS	Communication Scheduling
66	Davidson:2012:TTF	Toward Techniques for Auto-tuning GPU Algorithms
65	Patney:2009:PVT	Parallel View-Dependent Tessellation of Catmull-Clark Subdivision Surfaces
61	Davidson:2011:RPF	Register Packing for Cyclic Reduction: A Case Study

60	Owens:2002:CGO	Computer Graphics on a Stream Architecture
58	Lefohn:2005:IEP	Implementing Efficient Parallel Data Structures on GPUs
56	Moerschell:2008:DTM	Distributed Texture Memory in a Multi-GPU Environment
54	Osama:2023:SWP:poster	Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU
54	Yang:2015:FSM	Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU
54	Szumel:2005:TAM	Towards a Mobile Agent Framework for Sensor Networks
54	Osama:2023:SWP	Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU
53	Yang:2018:IPE	Implementing Push-Pull Efficiently in GraphBLAS
47	Tzeng:2012:AGT	A GPU Task-Parallel Model with Dependency Resolution
46	Awad:2020:DGO	Dynamic Graphs on the GPU
45	Lin:2019:BDL	Benchmarking Deep Learning Frameworks and Investigating FPGA Deployment for Traffic Sign Classification and Detection
45	Ebeida:2011:EAG	Efficient and Good Delaunay Meshes From Random Points
44	Geil:2018:QFA	Quotient Filters: Approximate Membership Queries on the GPU
43	Ebeida:2011:ICR	Isotropic conforming refinement of quadrilateral and hexahedral meshes using two-refinement templates
41	Owens:2002:CRA	Comparing Reyes and OpenGL on a Stream Architecture
40	Wu:2015:PCO	Performance Characterization of High-Level Programming Models for GPU Graph Analytics
39	Riffel:2004:MFM	Mio: Fast Multipass Partitioning via Priority-Based Instruction Scheduling
37	Ashkiani:2018:GLA	GPU LSM: A Dynamic Dictionary Data Structure for the GPU
37	Lefohn:2005:DAS	Dynamic Adaptive Shadow Maps on Graphics Hardware
33	Stone:2011:GPA	GPGPU parallel algorithms for structured-grid CFD codes
33	Stuart:2010:GC	GPU-to-CPU Callbacks
32	Odemuyiwa:2023:ASD	Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling
31	Lin:2022:BAP	Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
31	Osama:2019:GCO	Graph Coloring on the GPU
31	Owens:2005:AOG	Assessment of Graphic Processing Units (GPUs) for Department of Defense (DoD) Digital Signal Processing (DSP) Applications
30	Zhang:2011:AHM	A Hybrid Method for Solving Tridiagonal Systems on the GPU
30	Wang:2020:FGS	Fast Gunrock Subgraph Matching (GSM) on GPUs
30	Tzeng:2012:FCH	Finding Convex Hulls Using Quickhull on the GPU
29	Chen:2022:SIP	Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way
28	Glavtchev:2011:FSL	Feature-Based Speed Limit Sign Detection Using a Graphics Processing Unit
28	Stuart:2011:EMT	Extending MPI to Accelerators
27	Ashkiani:2016:GM	GPU Multisplit
27	Kniss:2005:OTO	Octree Textures on Graphics Hardware
26	Ashkiani:2017:GMA	GPU Multisplit: an extended study of a parallel algorithm
25	Chen:2022:AAT	Atos: A Task-Parallel GPU Scheduler for Graph Analytics
24	Awad:2023:AAI	Analyzing and Implementing GPU Hash Tables
24	Gosink:2009:DPB	Data Parallel Bin-Based Indexing for Answering Queries on Multi-Core Architectures
23	Mahmoud:2021:RAG	RXMesh: A GPU Mesh Data Structure
23	Wang:2015:FSA	Fast Parallel Suffix Array on the GPU
23	Zhang:2011:APE	A Parallel Error Diffusion Implementation on a GPU
22	Wang:2016:FPS	Fast Parallel Skew and Prefix-Doubling Suffix Array Construction on the GPU
22	Patney:2010:FCA	Fragment-Parallel Composite and Filter
22	Phillips:2010:UTS	Unsteady Turbulent Simulations on a Cluster of Graphics Processors
22	Park:2005:AFF	A Framework for Real-Time Volume Visualization of Streaming Scattered Data
21	Osama:2022:EOP	Essentials of Parallel Graph Analytics
21	Abdelkader:2018:SCF	Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm
21	Ebeida:2014:KDS	$k$-d Darts: Sampling by $k$-Dimensional Flat Searches
21	Gupta:2009:TOF	Three-Layer Optimizations for Fast GMM Computations on GPU-like Parallel Processors
20	Serebrin:2002:ASP	A Stream Processor Development Platform
19	Muyan-Ozcelik:2010:ATA	A Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System using GPU Computing
18	Wang:2019:ADI	Accelerating DNN Inference with GraphBLAS and the GPU
18	Pan:2018:SBS	Scalable Breadth-First Search on a GPU Cluster
18	Abdelkader:2017:ACR	A Constrained Resampling Strategy for Mesh Improvement
18	Gosink:2008:BIA	Bin-Hash Indexing: A Parallel Method For Fast Query Processing
17	Muyan-Ozcelik:2011:RSR	Real-Time Speed-Limit-Sign Recognition on an Embedded System Using a GPU
17	Ma:2007:UVR	Ultra-Scale Visualization: Research and Education
16	Awad:2022:AGM	A GPU Multiversion B-Tree
15	Gupta:2011:CAM	Compute \& Memory Optimizations for High-Quality Speech Recognition on Low-End GPU Processors
15	Szumel:2006:TVP	The Virtual Pheromone Communication Primitive
15	Khailany:2000:ISA	Imagine: Signal and Image Processing Using Streams
14	Osama:2023:APM	A Programming Model for GPU Load Balancing
14	Seitz:2019:SMF	Staged Metaprogramming for Shader System Development
13	Yih:2018:FVG	FPGA versus GPU for Speed-Limit-Sign Recognition
13	Geil:2014:WGC	WTF, GPU! Computing Twitter's Who-To-Follow on the GPU
12	Muyan-Ozcelik:2016:MRE	Multitasking Real-time Embedded GPU Computing Tasks
12	Ebeida:2013:SD	Sifted Disks
12	Zhang:2012:PDE	Plane-dependent Error Diffusion on a GPU
12	Odemuyiwa:2024:TEL	The EDGE Language: Extended General Einsums for Graph Algorithms
11	Seitz:2022:SUS	Supporting Unified Shader Specialization by Co-opting C++ Features
11	Wang:2019:FBT	Fast BFS-Based Triangle Counting on GPUs
10	Ebeida:2016:DDT	Disk Density Tuning of a Maximal Random Packing
9	Ashkiani:2016:PAT	Parallel Approaches to the String Matching Problem on the GPU
9	Liu:2018:OLA	Object Localization and Motion Transfer learning with Capsules
8	Lin:2025:TUP	Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
6	Ebeida:2014:EIH	Exercises in High-Dimensional Sampling: Maximal Poisson-disk Sampling and $k$-d Darts
6	Owens:2007:TMS	Towards Multi-GPU Support for Visualization
6	Owens:2004:GTF	GPUs tapped for general computing
5	Wapman:2023:HCA	Harmonic CUDA: Asynchronous Programming on GPUs
5	Brock:2019:RVR	RDMA vs.\ RPC for Implementing Distributed Data Structures
5	Abdelkader:2018:VIT	VoroCrust Illustrated: Theory and Challenges (Multimedia Exposition)
5	Weber:2015:PRA	Parallel Reyes-style Adaptive Subdivision with Bounded Memory Usage
4	Mahmoud:2025:DMP	Dynamic Mesh Processing on the GPU
4	Lin:2018:BDL	Benchmarking Deep Learning Frameworks with FPGA-suitable Models on a Traffic Sign Dataset
4	Mak:2014:GAE	GPU-Accelerated and Efficient Multi-View Triangulation for Scene Reconstruction
4	Phillips:2011:AO2	Acceleration of 2-D Compressible Flow Solvers with Graphics Processing Unit Clusters
4	Shinn:2023:TSR	The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
3	Muyan-Ozcelik:2017:MFM	Methods for Multitasking among Real-time Embedded Compute Tasks Running on the GPU
3	Gegan:2016:RGT	Real-Time GPU-based Timing Channel Detection using Entropy
3	Drescher:2023:BAP	BOBA: A Parallel Lightweight Graph Reordering Algorithm with Heavyweight Implications
2	Yuan:2026:BDB	BLASST: Dynamic BLocked Attention Sparsity via Softmax Thresholding
2	Smith:2025:PAT	Decoupled Fallback: A Portable Single-Pass GPU Scan
2	Owens:2018:TPG	Technical Perspective: Graphs, Betweenness Centrality, and the GPU
2	Wang:2017:MAL	Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU
2	Kemal:2016:MSA	Multidisciplinary simulation acceleration using multiple shared memory graphical processing units
2	Silberstein:2011:ASC	Applying Software-Managed Caching and CPU/GPU Task Scheduling for Accelerating Dynamic Workloads
2	Liu:2019:UOS	Unsupervised Object Segmentation with Explicit Localization Module
2	Seitz:2013:AGI	A GPU Implementation for Two-Dimensional Shallow Water Modeling
2	Owens:2004:OTS	On The Scalability of Sensor Network Routing and Compression Algorithms
2	Szumel:2003:OTF	On the Feasibility of the UC Davis Metanet
1	Geil:2023:MCE	Maximum Clique Enumeration on the GPU
1	Owens:2006:TIA	The Installation and Use of OpenType Fonts in \LaTeX
1	Odemuyiwa:2026:MEF	Mambalaya: Einsum-Based Fusion Optimizations on State-Space Models
1	Shojaei:2025:MA	MLPerf Automotive

Navigate