Personal tools
You are here: Home

Search results

Did you not find what you were looking for? Try the Advanced Search for more precise search options.

692 items matching your criteria. RSS Feed
Efficient Implementation of OpenMP for Clusters with Implicit Data Distribution [1%] by admin, 2007-12-10 21:05
Dragon: A Static and Dynamic Tool for OpenMP [1%] by admin, 2007-12-10 21:05
Selected Papers from the First European Workshop on OpenMP (EWOMP) [1%] by admin, 2007-12-10 21:05
Extensions d’OpenMP pour les Architectures Hierarchie Memoire Multiple [1%] by admin, 2007-12-10 21:05
Performance Oriented Programming for NUMA Architectures [1%] by admin, 2007-12-10 21:05
OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, ...
Program Development Environment for OpenMP on ccNUMA and NUMA Platforms [1%] by admin, 2007-12-10 21:05
OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. Although it has also been implemented on ccNUMA architectures, ...
Achieving Performance Under Openmp On Ccnuma And Software Distributed Shared Memory Systems [1%] by admin, 2007-12-10 21:05
OpenMP is emerging as a viable high-level programming model for shared memory parallel systems. It was conceived to enable easy, portable application ...
A Tool to Display Array Access Patterns in OpenMP Programs [1%] by admin, 2007-12-10 21:05
OpenMP is a de facto standard for shared memory programming that can be used to program SMPs and distributed shared memory systems. One way to improve OpenMP ...
Parallel Application Development with the Hybrid MPI+OpenMP Programming Model [1%] by admin, 2007-12-10 21:05
Many parallel platforms in use today are clusters of SMP systems, connected by Ethernet or one of the high-speed networks available. Some provide global memory ...
Performance Comparisons of Basic OpenMP Constructs [1%] by admin, 2007-12-10 21:05
OpenMP has become the de-facto standard for shared memory parallel programming. The directive based nature of OpenMP allows incremental and portable ...
Parallel Programming Models in the age of Globalization [1%] by admin, 2007-12-10 21:05
Implementing OpenMP using Dataflow execution Model for Data Locality and Efficient Parallel Execution [1%] by admin, 2007-12-10 21:05
In this paper, we show the potential benefits of translating OpenMP code to low-level parallel code using a data flow execution model, instead of targeting it ...
Cougar: Interactive Tool for Cluster Computing [1%] by admin, 2007-12-10 21:05
Improving the Performance of OpenMP by Array Privatization [1%] by admin, 2007-12-10 21:05
The scalability of an OpenMP program in a ccNUMA system with a large number of processors suffers from remote memory accesses, cache misses and false sharing. ...
Selected papers on OpenMP (including papers from the 2001 European Workshop on OpenMP) [1%] by admin, 2007-12-10 21:05
Asynchronous Execution of OpenMP Code [1%] by admin, 2007-12-10 21:05
Analyses and Optimizations for the Translation of OpenMP Codes into SPMD Style [1%] by admin, 2007-12-10 21:05
A so-called SPMD style OpenMP program can achieve scalability on ccNUMA systems by means of array privatization, and earlier research has shown good ...
Dragon: An Open64-Based Interactive Program Analysis Tool for Large Applications [1%] by admin, 2007-12-10 21:05
A program analysis tool can play an important role in helping users understand and improve large application codes. Dragon is a robust interactive program ...
Executing OpenMP on Distributed Memory Systems via Global Arrays [1%] by admin, 2007-12-10 21:05
This paper presents a source-to-source translation strategy from OpenMP to Global Arrays in order to extend OpenMP to distributed memory systems. This ...
Automatic Tuning of Whole Applications Using Direct Search and a Performance-based Transformation System [1%] by admin, 2007-12-10 21:05
In many cases, simple analytical models used by traditional compilers are no longer able to yield effectively optimized code for complex programs because of ...
Improving Performance by Reducing the Memory Footprint of Scientific Applications [1%] by admin, 2007-12-10 21:05
Over the last two decades, processor speeds have improved much faster than memory speeds. As a result, memory access delay is a major performance bottleneck in ...
Design and Implementation of Adifor90 [1%] by admin, 2007-12-10 21:05
In order to accurately and efficiently compute derivatives, many scientists and are abandoning divided differences in favor of Automatic Differentiation (AD). ...
An Evaluation of Global Address Space Languages: Co-Array Fortran and Unified Parallel C [1%] by admin, 2007-12-10 21:05
Co-array Fortran (CAF) and Unified Parallel C (UPC) are two emerging languages for single-program, multiple-data global address space programming. These ...
Effective communication coalescing for data-parallel applications [1%] by admin, 2007-12-10 21:05
Communication coalescing is a static optimization that can reduce both communication frequency and redundant data transfer in compiler-generated code for ...
Online Critical Path Profiling for Parallel Applications [1%] by admin, 2007-12-10 21:05
Scalarization on Short Vector Machines [1%] by admin, 2007-12-10 21:05
Scalarization is a process that converts array statements into loop nests so that they can run on a scalar machine. One technical difficulty of scalarization ...
Scalarization Using Loop Alignment and Loop Skewing [1%] by admin, 2007-12-10 21:05
Array syntax, which is supported in many technical programming languages, adds expressive power by allowing operations on and assignments to whole arrays and ...
Parameterizing Loop Fusion for Automated Empirical Tuning [1%] by admin, 2007-12-10 21:05
Traditional compilers are limited in their ability to optimize applications for different architectures because statically modeling the effect of specific ...
Automatic blocking of QR and LU factorizations for locality [1%] by admin, 2007-12-10 21:05
Automatic Search for Patterns of Inefficient Behavior in Parallel Applications [1%] by admin, 2007-12-10 21:05
Event tracing is a powerful method of analyzing the performance behavior of parallel applications. Because event traces record the temporal and spatial ...
« September 2010 »
Su Mo Tu We Th Fr Sa
1234
567891011
12131415161718
19202122232425
2627282930
 

Powered by Plone

LACSI Collaborators include:

Rice University LANL UH UNM UIUC UNC UTK