John Owens / Electrical and Computer Engineering / UC Davis

John Owens's calculated h-index is 61. This page was automatically generated on 2025-12-29.

3433	Owens:2007:ASO	A Survey of General-Purpose Computation on Graphics Hardware
3221	Owens:2008:GC	GPU Computing
2102	Liu:2020:EOD	Energy-based Out-of-distribution Detection
1407	Rixner:2000:MAS	Memory Access Scheduling
1228	Harris:2007:PPS	Parallel Prefix Sum (Scan) with CUDA
873	Sengupta:2007:SPF	Scan Primitives for GPU Computing
741	Wang:2016:GAH	Gunrock: A High-Performance Graph Processing Library on the GPU
661	Owens:2007:RCF	Research Challenges for On-Chip Interconnection Networks
506	Khailany:2001:IMP	Imagine: Media Processing with Streams
450	Kapasi:2003:PSP	Programmable Stream Processors
416	Zhang:2011:AQP	A Quantitative Performance Analysis Model for GPU Architectures
414	Rixner:2000:ROF	Register Organization for Media Processing
369	Kapasi:2002:TIS	The Imagine Stream Processor
362	Rixner:1998:ABA	A Bandwidth-Efficient Architecture for Media Processing
359	Zhang:2010:FTS	Fast Tridiagonal Solvers on the GPU
349	Kepner:2016:MFO	Mathematical Foundations of the GraphBLAS
321	Gupta:2012:ASO	A Study of Persistent Threads Style GPU Programming for GPGPU Workloads
287	Stuart:2011:MMO	Multi-GPU MapReduce on GPU Clusters
285	Alcantara:2009:RPH	Real-Time Parallel Hashing on the GPU
275	Davidson:2014:WPG	Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths
228	Lefohn:2006:GGE	Glift: Generic, Efficient, Random-Access GPU Data Structures
193	Wang:2017:GGG	Gunrock: GPU Graph Analytics
180	Owens:2005:SAA	Streaming Architectures and Technology Trends
177	Tzeng:2010:TMF	Task Management for Irregular-Parallel Workloads on the GPU
173	Yang:2018:DPF	Design Principles for Sparse Matrix Multiplication on the GPU
172	Muyan-Ozcelik:2008:FDR	Fast Deformable Registration on the GPU: A CUDA Implementation of Demons
162	Park:2006:DSI	Discrete Sibson Interpolation
156	Yang:2022:GAH	GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
156	Silberstein:2008:ECO	Efficient Computation of Sum-products on GPUs Through Software-Managed Cache
152	Patel:2012:PLD	Parallel Lossless Data Compression on the GPU
149	Samant:2008:HPC	High performance computing for deformable image registration: Towards a new paradigm in adaptive radiotherapy
145	Kapasi:2000:ECO	Efficient Conditional Operations for Data-parallel Architectures
143	Kass:2006:IDO	Interactive Depth of Field Using Simulated Diffusion on a GPU
141	Ebeida:2011:EMP	Efficient Maximal Poisson-Disk Sampling
140	Ebeida:2012:ASA	A Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions
129	Owens:2002:MPA	Media Processing Applications on the Imagine Stream Processor
126	Davidson:2011:AAM	An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU
126	Sengupta:2006:AWS	A Work-Efficient Step-Efficient Prefix Sum Algorithm
113	Stuart:2009:MPO	Message Passing on Data-Parallel Architectures
104	Phillips:2009:RAP	Rapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units
103	Davidson:2012:EPM	Efficient Parallel Merge Sort for Fixed and Variable Length Keys
102	Ashkiani:2018:ADH	A Dynamic Hash Table for the GPU
101	Lefohn:2007:RSM	Resolution-Matched Shadow Maps
97	Owens:2000:PRO	Polygon Rendering on a Stream Architecture
94	Pan:2017:MGA	Multi-GPU Graph Analytics
89	Alcantara:2011:BAE	Building an Efficient Hash Table on the GPU
85	Stuart:2010:MVR	Multi-GPU Volume Rendering using MapReduce
77	Stuart:2011:ESP	Efficient Synchronization Primitives for GPUs
76	Kapasi:2001:SS	Stream Scheduling
75	Budge:2009:ODM	Out-of-core Data Management for Path Tracing on Hybrid Resources
75	Patney:2008:RRA	Real-Time Reyes-Style Adaptive Surface Subdivision
75	Khailany:2003:ETV	Exploring the VLSI Scalability of Stream Processors
73	Wang:2016:ACS	A Comparative Study on Exact Triangle Counting Algorithms on the GPU
72	Awad:2019:EAH	Engineering a High-Performance GPU B-Tree
71	Davidson:2011:RPF	Register Packing for Cyclic Reduction: A Case Study
71	Mattson:2000:CS	Communication Scheduling
70	Jenkins:2011:LLF	Lessons Learned from Exploring the Backtracking Paradigm on the GPU
67	Davidson:2012:TTF	Toward Techniques for Auto-tuning GPU Algorithms
65	Abdelkader:2020:VVM	VoroCrust: Voronoi Meshing Without Clipping
63	Patney:2009:PVT	Parallel View-Dependent Tessellation of Catmull-Clark Subdivision Surfaces
62	Owens:2002:CGO	Computer Graphics on a Stream Architecture

60	Moerschell:2008:DTM	Distributed Texture Memory in a Multi-GPU Environment
57	Lefohn:2005:IEP	Implementing Efficient Parallel Data Structures on GPUs
56	Szumel:2005:TAM	Towards a Mobile Agent Framework for Sensor Networks
52	Yang:2018:IPE	Implementing Push-Pull Efficiently in GraphBLAS
48	Yang:2015:FSM	Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU
48	Tzeng:2012:AGT	A GPU Task-Parallel Model with Dependency Resolution
44	Ebeida:2011:EAG	Efficient and Good Delaunay Meshes From Random Points
43	Ebeida:2011:ICR	Isotropic conforming refinement of quadrilateral and hexahedral meshes using two-refinement templates
42	Awad:2020:DGO	Dynamic Graphs on the GPU
41	Osama:2023:SWP:poster	Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU
41	Lin:2019:BDL	Benchmarking Deep Learning Frameworks and Investigating FPGA Deployment for Traffic Sign Classification and Detection
41	Wu:2015:PCO	Performance Characterization of High-Level Programming Models for GPU Graph Analytics
41	Owens:2002:CRA	Comparing Reyes and OpenGL on a Stream Architecture
41	Osama:2023:SWP	Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU
40	Geil:2018:QFA	Quotient Filters: Approximate Membership Queries on the GPU
40	Riffel:2004:MFM	Mio: Fast Multipass Partitioning via Priority-Based Instruction Scheduling
37	Lefohn:2005:DAS	Dynamic Adaptive Shadow Maps on Graphics Hardware
33	Ashkiani:2018:GLA	GPU LSM: A Dynamic Dictionary Data Structure for the GPU
33	Stuart:2010:GC	GPU-to-CPU Callbacks
32	Stone:2011:GPA	GPGPU parallel algorithms for structured-grid CFD codes
32	Stuart:2011:EMT	Extending MPI to Accelerators
32	Zhang:2011:AHM	A Hybrid Method for Solving Tridiagonal Systems on the GPU
31	Owens:2005:AOG	Assessment of Graphic Processing Units (GPUs) for Department of Defense (DoD) Digital Signal Processing (DSP) Applications
30	Osama:2019:GCO	Graph Coloring on the GPU
30	Tzeng:2012:FCH	Finding Convex Hulls Using Quickhull on the GPU
28	Glavtchev:2011:FSL	Feature-Based Speed Limit Sign Detection Using a Graphics Processing Unit
28	Kniss:2005:OTO	Octree Textures on Graphics Hardware
27	Lin:2022:BAP	Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
27	Ashkiani:2016:GM	GPU Multisplit
27	Patney:2015:PAF	Piko: A Framework for Authoring Programmable Graphics Pipelines
26	Odemuyiwa:2023:ASD	Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling
26	Ashkiani:2017:GMA	GPU Multisplit: an extended study of a parallel algorithm
26	Wang:2020:FGS	Fast Gunrock Subgraph Matching (GSM) on GPUs
25	Gosink:2009:DPB	Data Parallel Bin-Based Indexing for Answering Queries on Multi-Core Architectures
24	Wang:2015:FSA	Fast Parallel Suffix Array on the GPU
24	Gupta:2009:TOF	Three-Layer Optimizations for Fast GMM Computations on GPU-like Parallel Processors
23	Chen:2022:SIP	Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way
23	Zhang:2011:APE	A Parallel Error Diffusion Implementation on a GPU
23	Patney:2010:FCA	Fragment-Parallel Composite and Filter
22	Wang:2016:FPS	Fast Parallel Skew and Prefix-Doubling Suffix Array Construction on the GPU
22	Phillips:2010:UTS	Unsteady Turbulent Simulations on a Cluster of Graphics Processors
22	Park:2005:AFF	A Framework for Real-Time Volume Visualization of Streaming Scattered Data
21	Awad:2023:AAI	Analyzing and Implementing GPU Hash Tables
20	Chen:2022:AAT	Atos: A Task-Parallel GPU Scheduler for Graph Analytics
20	Osama:2022:EOP	Essentials of Parallel Graph Analytics
20	Mahmoud:2021:RAG	RXMesh: A GPU Mesh Data Structure
20	Ebeida:2014:KDS	$k$-d Darts: Sampling by $k$-Dimensional Flat Searches
20	Ma:2007:UVR	Ultra-Scale Visualization: Research and Education
20	Serebrin:2002:ASP	A Stream Processor Development Platform
20	Gosink:2008:BIA	Bin-Hash Indexing: A Parallel Method For Fast Query Processing
19	Abdelkader:2018:SCF	Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm
19	Muyan-Ozcelik:2010:ATA	A Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System using GPU Computing
18	Wang:2019:ADI	Accelerating DNN Inference with GraphBLAS and the GPU
18	Abdelkader:2017:ACR	A Constrained Resampling Strategy for Mesh Improvement
18	Muyan-Ozcelik:2011:RSR	Real-Time Speed-Limit-Sign Recognition on an Embedded System Using a GPU
17	Pan:2018:SBS	Scalable Breadth-First Search on a GPU Cluster
16	Gupta:2011:CAM	Compute \& Memory Optimizations for High-Quality Speech Recognition on Low-End GPU Processors
16	Szumel:2006:TVP	The Virtual Pheromone Communication Primitive
16	Khailany:2000:ISA	Imagine: Signal and Image Processing Using Streams
14	Awad:2022:AGM	A GPU Multiversion B-Tree
12	Osama:2023:APM	A Programming Model for GPU Load Balancing
12	Seitz:2019:SMF	Staged Metaprogramming for Shader System Development
12	Yih:2018:FVG	FPGA versus GPU for Speed-Limit-Sign Recognition
12	Muyan-Ozcelik:2016:MRE	Multitasking Real-time Embedded GPU Computing Tasks
12	Geil:2014:WGC	WTF, GPU! Computing Twitter's Who-To-Follow on the GPU
12	Ebeida:2013:SD	Sifted Disks
12	Zhang:2012:PDE	Plane-dependent Error Diffusion on a GPU
10	Seitz:2022:SUS	Supporting Unified Shader Specialization by Co-opting C++ Features
10	Wang:2019:FBT	Fast BFS-Based Triangle Counting on GPUs
10	Ebeida:2016:DDT	Disk Density Tuning of a Maximal Random Packing
9	Ashkiani:2016:PAT	Parallel Approaches to the String Matching Problem on the GPU
8	Odemuyiwa:2024:TEL	The EDGE Language: Extended General Einsums for Graph Algorithms
8	Liu:2018:OLA	Object Localization and Motion Transfer learning with Capsules
7	Owens:2007:TMS	Towards Multi-GPU Support for Visualization
6	Owens:2004:GTF	GPUs tapped for general computing
5	Abdelkader:2018:VIT	VoroCrust Illustrated: Theory and Challenges (Multimedia Exposition)
5	Weber:2015:PRA	Parallel Reyes-style Adaptive Subdivision with Bounded Memory Usage
5	Ebeida:2014:EIH	Exercises in High-Dimensional Sampling: Maximal Poisson-disk Sampling and $k$-d Darts
4	Lin:2025:TUP	Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
4	Brock:2019:RVR	RDMA vs.\ RPC for Implementing Distributed Data Structures
4	Lin:2018:BDL	Benchmarking Deep Learning Frameworks with FPGA-suitable Models on a Traffic Sign Dataset
4	Mak:2014:GAE	GPU-Accelerated and Efficient Multi-View Triangulation for Scene Reconstruction
4	Phillips:2011:AO2	Acceleration of 2-D Compressible Flow Solvers with Graphics Processing Unit Clusters
3	Muyan-Ozcelik:2017:MFM	Methods for Multitasking among Real-time Embedded Compute Tasks Running on the GPU
3	Gegan:2016:RGT	Real-Time GPU-based Timing Channel Detection using Entropy
3	Drescher:2023:BAP	BOBA: A Parallel Lightweight Graph Reordering Algorithm with Heavyweight Implications
2	Wapman:2023:HCA	Harmonic CUDA: Asynchronous Programming on GPUs
2	Owens:2018:TPG	Technical Perspective: Graphs, Betweenness Centrality, and the GPU
2	Wang:2017:MAL	Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU
2	Kemal:2016:MSA	Multidisciplinary simulation acceleration using multiple shared memory graphical processing units
2	Silberstein:2011:ASC	Applying Software-Managed Caching and CPU/GPU Task Scheduling for Accelerating Dynamic Workloads
2	Shinn:2023:TSR	The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks
2	Seitz:2013:AGI	A GPU Implementation for Two-Dimensional Shallow Water Modeling
2	Owens:2004:OTS	On The Scalability of Sensor Network Routing and Compression Algorithms
2	Szumel:2003:OTF	On the Feasibility of the UC Davis Metanet
1	Geil:2023:MCE	Maximum Clique Enumeration on the GPU
1	Owens:2006:TIA	The Installation and Use of OpenType Fonts in \LaTeX
1	Liu:2019:UOS	Unsupervised Object Segmentation with Explicit Localization Module

Navigate