TALK KEYWORD INDEX

This page contains an index consisting of author-provided keywords.

A | |

a. | |

Accelerator computing | |

Accelerators | |

Adaptive Optics | |

Address Clustering | |

address sampling | |

Affinity | |

AIOps | |

All-Substrings LCS | |

AllScale | |

AMPI | |

AMTs | |

Analytical model | |

Anomaly Diagnosis | |

Applied Mathematics | |

approximate computing | |

Approximation algorithm | |

Architecture-aware Pruning | |

Asymptotic performance | |

Asynchronous Parallelism | |

Asynchronous task execution | |

Asynchronous Tasking | |

Asynchrony | |

Aurora-SX | |

Author keywords: | |

auto-scheduling | |

Auto-tuning | |

Autonomous control | |

B | |

b. | |

Benchmark testing | |

benchmarking | |

Benford's Law | |

Bi-objective optimization | |

Big Data | |

Bioinformatics | |

BLAS | |

Blockchain | |

Blockchain Analysis | |

breadth-first search | |

Buffering | |

Burst buffers | |

C | |

C++ | |

c. | |

Cache-Oblivious | |

Capacity Planning | |

ccNUMA | |

CFD simulations | |

Charm++ | |

checkpointing | |

chip multiprocessor | |

Closing | |

Cloud | |

Cloud Computing | |

Cloud Scaling | |

clustering | |

CNN | |

Co-execution | |

Collaboration | |

Collaborative Computing | |

Collective behavior | |

communication delays | |

Communication Optimizations | |

Community engagement | |

compiler | |

Complexity | |

Computational Astronomy | |

computational model | |

Computational Science | |

Compute Continuum | |

Computing clouds | |

Concurrency | |

Conference | |

Containers | |

Contextual Bandits | |

Control theory | |

Cost distributions | |

Coughing and Sneezing simulations | |

COVID-19 | |

Covid-19 Diffusion | |

cross-chain communication | |

CUDA | |

CUDA Graph | |

D | |

data access queries | |

Data Analytics | |

Data Assimilation | |

data layout | |

data locality | |

data-parallel applications | |

debug registers | |

Decentralised Exchange | |

decentralized exchange | |

deep learning | |

Deep neural network | |

Deep Neural Networks | |

Deep Reinforcement Learning | |

Denial-of-service | |

Deployment | |

design patterns | |

Digital Twin | |

Directives | |

DISSECT-CF | |

Distributed Algorithm | |

Distributed computing | |

distributed e-business workflows | |

Distributed Machine Learning | |

Distributed memory systems | |

Distributed Shared Memory | |

Distributed Stream Processing | |

Distributed Systems | |

DNN | |

Domain decomposition | |

Domain Specific Language | |

Domain-specific runtime | |

DPC++ | |

E | |

EASY backfilling | |

Edge | |

Edge computing | |

efficient algorithms | |

Emerging hardware | |

Empirical evaluation | |

emulation | |

Energy Efficiency | |

Energy optimization | |

energy saving | |

Epidemiological simulation | |

Euro-Par 2021 | |

Euro-Par 2022 | |

Eviction policy | |

Exascale Computing Project | |

External trees | |

Extreme-Scales | |

F | |

FaaS | |

Factorized Sparse Approximate Inverse | |

Failure Characterization | |

Failure Distribution | |

Farewell | |

fault tolerance | |

Fault-aware resource allocation | |

Fault-tolerant Execution | |

FFT | |

Field Programmable Gate Array | |

Fine-grain Parallelism | |

fixed-parameter algorithm | |

Fog | |

frequency scaling | |

G | |

GAN | |

GASPI | |

Genetic Sequence Comparison | |

Ginkgo | |

Glasgow | |

GPU | |

GPU Computing | |

GPU preemption | |

GPU programming | |

GPU sharing | |

Graph AI | |

graph databases | |

graph problems | |

Graph Processing | |

Graphical DSL | |

Graphs | |

H | |

hardware accelerators | |

hardware performance counters | |

Hardware Transactional Memory | |

Heterogeneity | |

Heterogeneous architectures | |

Heterogeneous computing | |

Heterogeneous data processing | |

Heterogeneous platforms | |

heterogeneous voltage | |

hierarchical architectures | |

High performance clouds | |

High Performance computing | |

High performance computing (HPC) | |

High Performance Conjugate Gradient | |

high-dimensional | |

high-performance computing | |

High-performance Graph Computing | |

HPC | |

HPC Systems | |

HPCG | |

HPX | |

Hybrid parallel computing | |

I | |

IaaS | |

IaC | |

IC-PCP | |

Impact of node failures on MPI parallel jobs | |

Incomplete Sparse Approximate Inverse | |

Intel GPUs | |

Inter-GPU communication | |

Irregular application | |

J | |

Java parallel streams | |

K | |

Kernel Fusion | |

kernel perforation | |

Key-Value Store | |

Keyword 1 | |

Keyword 2 | |

Keyword 3 | |

Keyword1 | |

Keyword2 | |

Keyword3 | |

Knowledge distillation | |

Knowledge Graphs | |

Kubernetes | |

L | |

large scale graph data | |

Lattice Boltzmann method | |

Layer fusion | |

LCS | |

linear algebra | |

Linked lists | |

Lisbon | |

llvm | |

load balancing | |

Load Imbalance | |

Load-balancing | |

Lock-freedom | |

Longest Processing Time (LPT) heuristic | |

loop optimization | |

Lower Bound | |

LU | |

M | |

Machine Learning | |

Mahalanobis distance | |

makespan | |

Manycore Parallelism | |

manycore systems | |

Math library | |

matrix factorizations | |

Memory Contention | |

memory hierarchy | |

memory traffic prioritization | |

memory-aware algorithms | |

Memory-aware scheduling | |

METIS | |

microservice | |

Microservices | |

Min-max optimization | |

Min-sum optimization | |

Mixed Precision | |

MLOps | |

Mobile Computing | |

Model management pipeline | |

model parallelism | |

Molecular dynamics | |

MPI | |

Multi-agent systems | |

Multi-GPUs | |

multi-resource scheduling | |

multicore | |

Multicore CPUs | |

Multilevel Memory | |

Multiphysics | |

N | |

NEC | |

Network performance | |

Neural Network Partitioning | |

neural networks | |

Non-Interactive | |

Non-linear Optimization Problems | |

nonvolatile memory | |

Numerical Linear Algebra | |

O | |

offloading | |

oneAPI | |

online job scheduling | |

Online Scheduling | |

OpenCL | |

OpenMP | |

Optimization | |

Orthogonal resource | |

Overlapping communication and computations | |

P | |

parallel | |

Parallel Algorithms | |

Parallel Approximation Algorithms | |

parallel computing | |

Parallel Dynamic Programming | |

Parallel ILU preconditioner | |

parallel label propagation | |

Parallel machines | |

Parallel model training | |

parallel numerical methods | |

Parallel Programming | |

Parallel String Comparison | |

Parallel tool | |

Parallelism | |

Parallelization | |

Particle simulations | |

Particle-In-Cell (PIC) | |

Performance | |

Performance Analysis | |

Performance Modeling | |

Performance optimization | |

Performance portability | |

Performance scalability | |

Pipelining | |

Plasma Simulation | |

polyhedral model | |

Power regulation | |

Preconditioned Conjugate Gradient | |

Preconditioning | |

Priority queues | |

Privacy | |

process mapping | |

Profiling | |

Programming model | |

programming models | |

public procurement | |

Q | |

Quality of Service | |

R | |

RDMA | |

Real-Time Processing | |

Reconfigurable architectures | |

Reinforcement Learning | |

Reliability | |

rematerialization | |

Replica Selection | |

Replication | |

Reputation | |

resilience | |

Resource Management | |

Resource Optimization | |

Resource sharing | |

reuse distance | |

Rigid jobs | |

runtime system | |

Runtime systems | |

S | |

SABNAtk | |

Scalability | |

scheduling | |

Scheduling and optimization | |

Score-P | |

Sequence Alignment | |

Serverless | |

Serverless computing | |

Serverless trace | |

Serverless workload | |

Service level objective | |

Service mesh | |

Shared data-structures | |

Sigmoid model | |

simulated annealing | |

simulation | |

Skip lists | |

Smart Contract | |

smart contracts | |

Social media | |

Software Transactional Memory | |

Sparse computation | |

sparse format | |

Sparse Linear Algebra | |

Sparse linear systems | |

Sparse matrix matrix multiplication | |

Spatial prisoner’s dilemma game | |

SpMV | |

Spot GPU | |

Stochastic Levenberg-Marquardt | |

Stream | |

Stream Processing | |

String Comparison | |

subgraph isomorphism | |

Supercomputers | |

Supply chain management | |

SYCL | |

Synthetic node failure trace generation | |

T | |

Tail Latency | |

Task granularity | |

task graph parallelism | |

Task parallelism | |

Task Scheduling | |

Task-based Programming | |

task-based programming model | |

Task-Based Runtimes | |

Task-level parallelism | |

Tasking | |

Tasks sharing data | |

Telemetry | |

Tensor core | |

time-lock puzzle | |

tool | |

trace | |

TRSM | |

Trust | |

V | |

Vector computing | |

Verifiable Delay Function | |

Video Transcoding | |

virtual infrastructure planning | |

W | |

Weighted All-Substrings LCS | |

Weighted LCS | |

Welcome | |

workflow | |

workload | |

Workshare | |

Z | |

Zero Copy | |

Zero Copy API |