Search results
Did you not find what you were looking for? Try the Advanced Search for more precise search options.
        
            692 items matching your criteria.
        
        
        
        
        
        
        
        
        
        
        
        - Efficient Implementation of OpenMP for Clusters with Implicit Data Distribution [1%] by admin, 2007-12-10 21:05
 - Dragon: A Static and Dynamic Tool for OpenMP [1%] by admin, 2007-12-10 21:05
 - Selected Papers from the First European Workshop on OpenMP (EWOMP) [1%] by admin, 2007-12-10 21:05
 - Extensions d’OpenMP pour les Architectures Hierarchie Memoire Multiple [1%] by admin, 2007-12-10 21:05
 - Performance Oriented Programming for NUMA Architectures [1%] by admin, 2007-12-10 21:05
 - OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, ...
 - Program Development Environment for OpenMP on ccNUMA and NUMA Platforms [1%] by admin, 2007-12-10 21:05
 - OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, ...
 - Achieving Performance Under Openmp On Ccnuma And Software Distributed Shared Memory Systems [1%] by admin, 2007-12-10 21:05
 - OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. It was conceived to enable easy, portable application ...
 - A Tool to Display Array Access Patterns in OpenMP Programs [1%] by admin, 2007-12-10 21:05
 - OpenMP is a de facto standard for shared memory programming that can be used to program SMPs and distributed shared memory systems. One way to improve OpenMP ...
 - Parallel Application Development with the Hybrid MPI+OpenMP Programming Model [1%] by admin, 2007-12-10 21:05
 - Many parallel platforms in use today are clusters of SMP systems, connected by Ethernet or one of the high-speed networks available. Some provide global memory ...
 - Performance Comparisons of Basic OpenMP Constructs [1%] by admin, 2007-12-10 21:05
 - OpenMP has become the de-facto standard for shared memory parallel programming. The directive based nature of OpenMP allows incremental and portable ...
 - Parallel Programming Models in the age of Globalization [1%] by admin, 2007-12-10 21:05
 - Implementing OpenMP using Dataflow execution Model for Data Locality and Efficient Parallel Execution [1%] by admin, 2007-12-10 21:05
 - In this paper, we show the potential benefits of translating OpenMP code to low-level parallel code using a data flow execution model, instead of targeting it ...
 - Cougar: Interactive Tool for Cluster Computing [1%] by admin, 2007-12-10 21:05
 - Improving the Performance of OpenMP by Array Privatization [1%] by admin, 2007-12-10 21:05
 - The scalability of an OpenMP program in a ccNUMA system with a large number of processors suffers from remote memory accesses, cache misses and false sharing. ...
 - Selected papers on OpenMP (including papers from the 2001 European Workshop on OpenMP) [1%] by admin, 2007-12-10 21:05
 - Asynchronous Execution of OpenMP Code [1%] by admin, 2007-12-10 21:05
 - Analyses and Optimizations for the Translation of OpenMP Codes into SPMD Style [1%] by admin, 2007-12-10 21:05
 - A so-called SPMD style OpenMP program can achieve scalability on ccNUMA systems by means of array privatization, and earlier research has shown good ...
 - Dragon: An Open64-Based Interactive Program Analysis Tool for Large Applications [1%] by admin, 2007-12-10 21:05
 - A program analysis tool can play an important role in helping users understand and improve large application codes. Dragon is a robust interactive program ...
 - Executing OpenMP on Distributed Memory Systems via Global Arrays [1%] by admin, 2007-12-10 21:05
 - This paper presents a source-to-source translation strategy from OpenMP to Global Arrays in order to extend OpenMP to distributed memory systems. This ...
 - Automatic Tuning of Whole Applications Using Direct Search and a Performance-based Transformation System [1%] by admin, 2007-12-10 21:05
 - In many cases, simple analytical models used by traditional compilers are no longer able to yield effectively optimized code for complex programs because of ...
 - Improving Performance by Reducing the Memory Footprint of Scientific Applications [1%] by admin, 2007-12-10 21:05
 - Over the last two decades, processor speeds have improved much faster than memory speeds. As a result, memory access delay is a major performance bottleneck in ...
 - Design and Implementation of Adifor90 [1%] by admin, 2007-12-10 21:05
 - In order to accurately and efficiently compute derivatives, many scientists and are abandoning divided differences in favor of Automatic Differentiation (AD). ...
 - An Evaluation of Global Address Space Languages: Co-Array Fortran and Unified Parallel C [1%] by admin, 2007-12-10 21:05
 - Co-array Fortran (CAF) and Unified Parallel C (UPC) are two emerging languages for single-program, multiple-data global address space programming. These ...
 - Effective communication coalescing for data-parallel applications [1%] by admin, 2007-12-10 21:05
 - Communication coalescing is a static optimization that can reduce both communication frequency and redundant data transfer in compiler-generated code for ...
 - Online Critical Path Profiling for Parallel Applications [1%] by admin, 2007-12-10 21:05
 - Scalarization on Short Vector Machines [1%] by admin, 2007-12-10 21:05
 - Scalarization is a process that converts array statements into loop nests so that they can run on a scalar machine. One technical difficulty of scalarization ...
 - Scalarization Using Loop Alignment and Loop Skewing [1%] by admin, 2007-12-10 21:05
 - Array syntax, which is supported in many technical programming languages, adds expressive power by allowing operations on and assignments to whole arrays and ...
 - Parameterizing Loop Fusion for Automated Empirical Tuning [1%] by admin, 2007-12-10 21:05
 - Traditional compilers are limited in their ability to optimize applications for different architectures because statically modeling the effect of specific ...
 - Automatic blocking of QR and LU factorizations for locality [1%] by admin, 2007-12-10 21:05
 - Automatic Search for Patterns of Inefficient Behavior in Parallel Applications [1%] by admin, 2007-12-10 21:05
 - Event tracing is a powerful method of analyzing the performance behavior of parallel applications. Because event traces record the temporal and spatial ...