Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

How to reach maximum theoretical performance in solving linear equations systems on FPS Architecture 38/64 bits

Conference Paper
Publication Date:
1987
abstract:
A technique for dense linear system solution is presented which reaches maximum performances on attached processors like FPS-120, 5000 and X64 using the Fortran language with calls to the vector routines. Starting from the Dongarra's LU factorization algorithm the key idea is to carry out a pseudo-transposition of the lower triangular matrix L (including the main diagonal) around the minor diagonal. The pseudo-transposition allows to carry out all the matrix vector operations involved in LU factorization with only stride 1 dot product operations which, using the TM Auxiliary Memory and the TMDOT routine, can be executed in the FPS processor obtaining the maximum speed. Since the algorithm uses only vector instructions it is fully portable on all the FPS 38/64 bit machines and in general on all the vector computers with a similar memory structure. Furthermore the algorithm can be easily translated into the new FORTRAN 8X, which will probably become the standard for future SIMD computers for numerical applications.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
solution of linear equations; optimized algorithms; vector processors; FPS architecttures; performance evaluation
List of contributors:
Morgavi, Giovanna; Marconi, Lucia; Martini, Claudio; Rolando, Claudia; Corana, Angelo
Handle:
https://iris.cnr.it/handle/20.500.14243/201227
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)