Skip to content

Latest commit

 

History

History
33 lines (27 loc) · 1.64 KB

File metadata and controls

33 lines (27 loc) · 1.64 KB

HiCMA-PaRSEC Release Information

Current Version: 0.1.0

Major Features

  • Hierarchical Matrix Computations: Tile Low-Rank (TLR) Cholesky factorization
  • Mixed Precision Support: Double, single, half, and FP8 precision arithmetic
  • GPU Acceleration: CUDA and HIP support for heterogeneous computing
  • Adaptive Memory Management: Dynamic precision selection and memory allocation
  • PaRSEC Runtime Integration: Task-based parallel execution with automatic load balancing

Core Capabilities

  • Matrix Operations: Matrix multiplication, Cholesky factorization, rank-k updates
  • Sparse Matrix Analysis: Bandwidth optimization and memory usage estimation
  • Climate Modeling: Exascale climate emulator with spherical harmonic transforms
  • Scientific Computing: 3D mesh deformation, spatial statistics, genomics applications

Build System

  • Modern CMake: CMake 3.16+ with presets and modern policies
  • Cross-Platform Support: Linux, macOS, and Windows compatibility
  • Automated Testing: Comprehensive test suite with unit, integration, and performance tests
  • Docker Support: Containerized builds for different environments

Dependencies

  • DPLASMA: Distributed Parallel Linear Algebra Software
  • STARS-H: Hierarchical matrix generation and approximation
  • HCORE: Low-rank matrix operations and BLAS kernels

Documentation

  • Comprehensive Guides: Build, testing, and usage documentation
  • API Documentation: Detailed function and module documentation
  • Examples: Climate emulator and scientific computing examples
  • Project Structure: Detailed project organization documentation