--- ---

John Owens's calculated h-index is 37. This page was automatically generated on 2014-05-23.

1976Owens:2007:ASOA Survey of General-Purpose Computation on Graphics Hardware
1041Owens:2008:GCGPU Computing
550Rixner:2000:MASMemory Access Scheduling
443Sengupta:2007:SPFScan Primitives for GPU Computing
439Harris:2007:PPSParallel Prefix Sum (Scan) with CUDA
368Khailany:2001:IMPImagine: Media Processing with Streams
337Kapasi:2003:PSPProgrammable Stream Processors
331Owens:2007:RCFResearch Challenges for On-Chip Interconnection Networks
331Rixner:2000:ROFRegister Organization for Media Processing
296Rixner:1998:ABAA Bandwidth-Efficient Architecture for Media Processing
251Kapasi:2002:TISThe Imagine Stream Processor
152Lefohn:2006:GGEGlift: Generic, Efficient, Random-Access GPU Data Structures
138Owens:2005:SAAStreaming Architectures and Technology Trends
107Kapasi:2000:ECOEfficient Conditional Operations for Data-parallel Architectures
104Zhang:2011:AQPA Quantitative Performance Analysis Model for GPU Architectures
102Silberstein:2008:ECOEfficient Computation of Sum-products on GPUs Through Software-Managed Cache
100Zhang:2010:FTSFast Tridiagonal Solvers on the GPU
99Owens:2002:MPAMedia Processing Applications on the Imagine Stream Processor
88Samant:2008:HPCHigh performance computing for deformable image registration: Towards a new paradigm in adaptive radiotherapy
79Muyan-Ozcelik:2008:FDRFast Deformable Registration on the GPU: A CUDA Implementation of Demons
77Kass:2006:IDOInteractive Depth of Field Using Simulated Diffusion on a GPU
74Alcantara:2009:RPHReal-Time Parallel Hashing on the GPU
74Owens:2000:PROPolygon Rendering on a Stream Architecture
63Khailany:2003:ETVExploring the VLSI Scalability of Stream Processors
57Park:2006:DSIDiscrete Sibson Interpolation
55Sengupta:2006:AWSA Work-Efficient Step-Efficient Prefix Sum Algorithm
54Mattson:2000:CSCommunication Scheduling
52Kapasi:2001:SSStream Scheduling
51Lefohn:2007:RSMResolution-Matched Shadow Maps
48Stuart:2011:MMOMulti-GPU MapReduce on GPU Clusters
47Stuart:2009:MPOMessage Passing on Data-Parallel Architectures
45Tzeng:2010:TMFTask Management for Irregular-Parallel Workloads on the GPU
45Phillips:2009:RAPRapid Aerodynamic Performance Prediction on a Cluster of Graphics Processing Units
43Szumel:2005:TAMTowards a Mobile Agent Framework for Sensor Networks
42Patney:2008:RRAReal-Time Reyes-Style Adaptive Surface Subdivision
41Lefohn:2005:IEPImplementing Efficient Parallel Data Structures on GPUs
39Moerschell:2008:DTMDistributed Texture Memory in a Multi-GPU Environment

33Owens:2002:CRAComparing Reyes and OpenGL on a Stream Architecture
32Ebeida:2011:EMPEfficient Maximal Poisson-Disk Sampling
31Budge:2009:ODMOut-of-core Data Management for Path Tracing on Hybrid Resources
31Lefohn:2005:DASDynamic Adaptive Shadow Maps on Graphics Hardware
28Riffel:2004:MFMMio: Fast Multipass Partitioning via Priority-Based Instruction Scheduling
28Patney:2009:PVTParallel View-Dependent Tessellation of Catmull-Clark Subdivision Surfaces
27Stuart:2010:MVRMulti-GPU Volume Rendering using MapReduce
24Davidson:2011:AAMAn Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU
22Owens:2005:AOGAssessment of Graphic Processing Units (GPUs) for Department of Defense (DoD) Digital Signal Processing (DSP) Applications
20Ebeida:2012:ASAA Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions
19Park:2005:AFFA Framework for Real-Time Volume Visualization of Streaming Scattered Data
18Sengupta:2011:EPSEfficient Parallel Scan Algorithms for many-core GPUs
17Gupta:2012:ASOA Study of Persistent Threads Style GPU Programming for GPGPU Workloads
16Kniss:2005:OTOOctree Textures on Graphics Hardware
15Serebrin:2002:ASPA Stream Processor Development Platform
13Davidson:2012:TTFToward Techniques for Auto-tuning GPU Algorithms
13Patel:2012:PLDParallel Lossless Data Compression on the GPU
13Davidson:2010:TTFToward Techniques for Auto-Tuning GPU Algorithms
12Davidson:2011:RPFRegister Packing for Cyclic Reduction: A Case Study
12Phillips:2010:UTSUnsteady Turbulent Simulations on a Cluster of Graphics Processors
11Szumel:2006:TVPThe Virtual Pheromone Communication Primitive
11Gosink:2009:DPBData Parallel Bin-Based Indexing for Answering Queries on Multi-Core Architectures
11Stuart:2011:ESPEfficient Synchronization Primitives for GPUs
11Davidson:2012:EPMEfficient Parallel Merge Sort for Fixed and Variable Length Keys
11Gupta:2009:TOFThree-Layer Optimizations for Fast GMM Computations on GPU-like Parallel Processors
10Ma:2007:UVRUltra-Scale Visualization: Research and Education
10Khailany:2000:ISAImagine: Signal and Image Processing Using Streams
10Stuart:2010:GCGPU-to-CPU Callbacks
10Stone:2011:GPAGPGPU parallel algorithms for structured-grid CFD codes
9Ebeida:2011:EAGEfficient and Good Delaunay Meshes From Random Points
9Jenkins:2011:LLFLessons Learned from Exploring the Backtracking Paradigm on the GPU
9Muyan-Ozcelik:2010:ATAA Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System using GPU Computing
9Alcantara:2011:BAEBuilding an Efficient Hash Table on the GPU
8Patney:2010:FCAFragment-Parallel Composite and Filter
8Tzeng:2012:AGTA GPU Task-Parallel Model with Dependency Resolution
7Stuart:2011:EMTExtending MPI to Accelerators
7Glavtchev:2011:FSLFeature-Based Speed Limit Sign Detection Using a Graphics Processing Unit
7Ebeida:2011:ICRIsotropic conforming refinement of quadrilateral and hexahedral meshes using two-refinement templates
5Zhang:2011:AHMA Hybrid Method for Solving Tridiagonal Systems on the GPU
4Owens:2004:GTFGPUs tapped for general computing
4Tzeng:2012:FCHFinding Convex Hulls Using Quickhull on the GPU
4Tzeng:2012:HPDHigh-Quality Parallel Depth-of-Field Using Line Samples
3Li:2012:KOTkANN on the GPU with Shifted Sorting
3Zhang:2011:APEA Parallel Error Diffusion Implementation on a GPU
2Gupta:2011:CAMCompute \& Memory Optimizations for High-Quality Speech Recognition on Low-End GPU Processors
1Zhang:2012:PDEPlane-dependent Error Diffusion on a GPU
1Szumel:2003:OTFOn the Feasibility of the UC Davis Metanet

---