Search results
Did you not find what you were looking for? Try the Advanced Search for more precise search options.
692 items matching your criteria.
- Efficient Implementation of OpenMP for Clusters with Implicit Data Distribution [1%] by admin, 2007-12-10 21:05
- Dragon: A Static and Dynamic Tool for OpenMP [1%] by admin, 2007-12-10 21:05
- Selected Papers from the First European Workshop on OpenMP (EWOMP) [1%] by admin, 2007-12-10 21:05
- Extensions d’OpenMP pour les Architectures Hierarchie Memoire Multiple [1%] by admin, 2007-12-10 21:05
- Performance Oriented Programming for NUMA Architectures [1%] by admin, 2007-12-10 21:05
- OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, ...
- Program Development Environment for OpenMP on ccNUMA and NUMA Platforms [1%] by admin, 2007-12-10 21:05
- OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, ...
- Achieving Performance Under Openmp On Ccnuma And Software Distributed Shared Memory Systems [1%] by admin, 2007-12-10 21:05
- OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. It was conceived to enable easy, portable application ...
- A Tool to Display Array Access Patterns in OpenMP Programs [1%] by admin, 2007-12-10 21:05
- OpenMP is a de facto standard for shared memory programming that can be used to program SMPs and distributed shared memory systems. One way to improve OpenMP ...
- Parallel Application Development with the Hybrid MPI+OpenMP Programming Model [1%] by admin, 2007-12-10 21:05
- Many parallel platforms in use today are clusters of SMP systems, connected by Ethernet or one of the high-speed networks available. Some provide global memory ...
- Performance Comparisons of Basic OpenMP Constructs [1%] by admin, 2007-12-10 21:05
- OpenMP has become the de-facto standard for shared memory parallel programming. The directive based nature of OpenMP allows incremental and portable ...
- Parallel Programming Models in the age of Globalization [1%] by admin, 2007-12-10 21:05
- Implementing OpenMP using Dataflow execution Model for Data Locality and Efficient Parallel Execution [1%] by admin, 2007-12-10 21:05
- In this paper, we show the potential benefits of translating OpenMP code to low-level parallel code using a data flow execution model, instead of targeting it ...
- Cougar: Interactive Tool for Cluster Computing [1%] by admin, 2007-12-10 21:05
- Improving the Performance of OpenMP by Array Privatization [1%] by admin, 2007-12-10 21:05
- The scalability of an OpenMP program in a ccNUMA system with a large number of processors suffers from remote memory accesses, cache misses and false sharing. ...
- Selected papers on OpenMP (including papers from the 2001 European Workshop on OpenMP) [1%] by admin, 2007-12-10 21:05
- Asynchronous Execution of OpenMP Code [1%] by admin, 2007-12-10 21:05
- Analyses and Optimizations for the Translation of OpenMP Codes into SPMD Style [1%] by admin, 2007-12-10 21:05
- A so-called SPMD style OpenMP program can achieve scalability on ccNUMA systems by means of array privatization, and earlier research has shown good ...
- Dragon: An Open64-Based Interactive Program Analysis Tool for Large Applications [1%] by admin, 2007-12-10 21:05
- A program analysis tool can play an important role in helping users understand and improve large application codes. Dragon is a robust interactive program ...
- Executing OpenMP on Distributed Memory Systems via Global Arrays [1%] by admin, 2007-12-10 21:05
- This paper presents a source-to-source translation strategy from OpenMP to Global Arrays in order to extend OpenMP to distributed memory systems. This ...
- Automatic Tuning of Whole Applications Using Direct Search and a Performance-based Transformation System [1%] by admin, 2007-12-10 21:05
- In many cases, simple analytical models used by traditional compilers are no longer able to yield effectively optimized code for complex programs because of ...
- Improving Performance by Reducing the Memory Footprint of Scientific Applications [1%] by admin, 2007-12-10 21:05
- Over the last two decades, processor speeds have improved much faster than memory speeds. As a result, memory access delay is a major performance bottleneck in ...
- Design and Implementation of Adifor90 [1%] by admin, 2007-12-10 21:05
- In order to accurately and efficiently compute derivatives, many scientists and are abandoning divided differences in favor of Automatic Differentiation (AD). ...
- An Evaluation of Global Address Space Languages: Co-Array Fortran and Unified Parallel C [1%] by admin, 2007-12-10 21:05
- Co-array Fortran (CAF) and Unified Parallel C (UPC) are two emerging languages for single-program, multiple-data global address space programming. These ...
- Effective communication coalescing for data-parallel applications [1%] by admin, 2007-12-10 21:05
- Communication coalescing is a static optimization that can reduce both communication frequency and redundant data transfer in compiler-generated code for ...
- Online Critical Path Profiling for Parallel Applications [1%] by admin, 2007-12-10 21:05
- Scalarization on Short Vector Machines [1%] by admin, 2007-12-10 21:05
- Scalarization is a process that converts array statements into loop nests so that they can run on a scalar machine. One technical difficulty of scalarization ...
- Scalarization Using Loop Alignment and Loop Skewing [1%] by admin, 2007-12-10 21:05
- Array syntax, which is supported in many technical programming languages, adds expressive power by allowing operations on and assignments to whole arrays and ...
- Parameterizing Loop Fusion for Automated Empirical Tuning [1%] by admin, 2007-12-10 21:05
- Traditional compilers are limited in their ability to optimize applications for different architectures because statically modeling the effect of specific ...
- Automatic blocking of QR and LU factorizations for locality [1%] by admin, 2007-12-10 21:05
- Automatic Search for Patterns of Inefficient Behavior in Parallel Applications [1%] by admin, 2007-12-10 21:05
- Event tracing is a powerful method of analyzing the performance behavior of parallel applications. Because event traces record the temporal and spatial ...