Days: Sunday, May 14th Monday, May 15th Tuesday, May 16th Wednesday, May 17th
View this program: with abstractssession overviewtalk overview
08:30 | CCGRID-LIFE Welcome ( abstract ) |
09:00 | Medical Imaging Processing on a Big Data platform using Python: Experiences with Heterogeneous and Homogeneous Architectures ( abstract ) |
09:30 | Analog-Digital Approach in Human Brain Modeling ( abstract ) |
10:00 | Biopet: towards Scalable, Maintainable, User-friendly, Robust and Flexible NGS data analysis pipelines ( abstract ) |
08:30 | Apply Block Index Technique to Scientific Data Analysis and I/O Systems ( abstract ) |
09:00 | Smart RDF Data storage in Graph Databases ( abstract ) |
09:30 | A Level-Wise Load Balanced Scientific Workflow Execution Optimization using NSGA-II ( abstract ) |
10:00 | RAPID: A Fast Data Update Protocol in Erasure Coded Storage Systems for Big Data ( abstract ) |
08:30 | Cost Model And Analysis of Iterative MapReduce Applications for Hybrid Cloud Bursting ( abstract ) |
09:00 | Cloud Resource Scaling for Big Data Streaming Applications Using A Layered Multi-dimensional Hidden Markov Model ( abstract ) |
09:30 | Scheduling Data Stream Jobs on Distributed Systems with Background Load ( abstract ) |
11:00 | Using the Cloud for parameter estimation problems: comparing Spark vs MPI with a case-study ( abstract ) |
11:30 | Fine-grained Supervision and Restriction of Biomedical Applications in Linux Containers ( abstract ) |
12:00 | CCGRID-LIFE Panel discussion ( abstract ) |
11:00 | MapReduce-based Algorithms for Managing Big RDF Graphs: State-of-the-Art Analysis, Paradigms, and Future Directions ( abstract ) |
11:30 | Optimized MapFile based Storage of Small Files in Hadoop ( abstract ) |
12:00 | An Accuracy-Aware Implementation of Two-Point Three-Dimensional Correlation Function using Bin-Recycling Strategy on GPU ( abstract ) |
12:30 | A Big Data Architecture for Automotive Applications: PSA Group Deployment Experience ( abstract ) |
11:00 | Defining Intercloud Security Framework and Architecture Components for Multi-Cloud Data Intensive Applications ( abstract ) |
11:30 | Improving Resource Efficiency of Container-instance Clusters on Clouds ( abstract ) |
12:00 | Hybrid Mobile Edge Computing: Unleashing the Full Potential of Edge Computing in Mobile Device Use Cases ( abstract ) |
12:30 | Multi-Objective Particle Swarm Optimization for VM Placement(MOPSO-VMP) in Multi-Cloud ( abstract ) |
Juan Antonio Rico Gallego (University of Extremadura, Spain)
14:00 | Analyzing the Parallel I/O Severity of MPI Applications ( abstract ) |
14:30 | Formal modeling and performance evaluation of a run-time rank remapping technique in Broadcast, Allgather and Allreduce MPI collective operations ( abstract ) |
15:00 | Reducing Load Imbalance of Virtual Clusters via Reconfiguration and Adaptive Job Scheduling ( abstract ) |
15:30 | Performance Models for Communication in Collective I/O Operations ( abstract ) |
Anna Queralt (Barcelona Supercomputing Center, Spain)
14:00 | Task-based programming model as an alternative for Big Data and Analytics ( abstract ) |
15:00 | A Data-driven Approach based on Auto-Regressive Models for Energy-Efficient Clouds ( abstract ) |
15:30 | An Empirical Evaluation of How The Network Impacts The Performance and Energy Efficiency in RAMCloud ( abstract ) |
Roy Campbell (University of Illinois at Urbana-Champaign, USA)
Wagner Meira Jr. (Universidade Federal do Minas Gerais, Brazil)
14:00 | US Air Force Interests and Directions in Cyber Security ( abstract ) |
14:30 | IT Security and Privacy Standards in Comparison Improving FedRAMP Authorization for Cloud Service Providers ( abstract ) |
14:50 | A Game-Theoretic Approach for Runtime Capacity Allocation in MapReduce ( abstract ) |
15:10 | Automatic Consolidation of Virtual Machines in On-Premises Cloud Platforms ( abstract ) |
15:30 | PRIVAaaS: privacy approach for a distributed cloud-based data analytics platforms ( abstract ) |
Juan Antonio Rico Gallego (University of Extremadura, Spain)
16:30 | Automatic Adaption of the Sampling Frequency for Detailed Performance Analysis ( abstract ) |
17:00 | Empirical Mode Decomposition for Modeling of Parallel Applications on Intel Xeon Phi Processors ( abstract ) |
17:30 | Energy model for low-power cluster ( abstract ) |
18:00 | Near-optimal policies for energy-aware task assignment in server farms ( abstract ) |
Anna Queralt (Barcelona Supercomputing Center, Spain)
16:30 | Evaluation of HPC-Big Data Applications Using Cloud Platforms ( abstract ) |
17:00 | Exploring Shared State in Key-Value Store for Window-Based Multi-Pattern Streaming Analytics ( abstract ) |
17:30 | On the Use of In-Memory Analytics Workflows to Compute eScience Indicators from Large Climate Datasets ( abstract ) |
Roy Campbell (University of Illinois at Urbana-Champaign, USA)
Wagner Meira Jr. (Universidade Federal do Minas Gerais, Brazil)
16:30 | Assured Cloud Computing: are we there yet? ( abstract ) |
17:00 | A lightweight MapReduce framework for secure processing with SGX ( abstract ) |
17:30 | Evaluating the performance of continuous test-based cloud service certification ( abstract ) |
18:00 | A game theoretic method for VM-to-hypervisor attacks detection in cloud environment ( abstract ) |
View this program: with abstractssession overviewtalk overview
Jesus Carretero (Universidad Carlos III de Madrid, Spain)
Javier Garcia Blas (Carlos III University, Spain)
Manish Parashar (State University of New Jersey University, USA)
Abstract
The adoption of machine learning is proving to be an amazingly successful strategy in improving predictive models for cancer and infectious disease. In this talk I will discuss two projects my group is working on to advance biomedical research through the use of machine learning and HPC. In cancer, machine learning and in deep learning in particular, is used to advance our ability to diagnosis and classify tumors. Recently demonstrated automated systems are routinely out performing human expertise. Deep learning is also being used to predict patient response to cancer treatments and to screen for new anti-cancer compounds. In basic cancer research its being use to supervise large-scale multi-resolution molecular dynamics simulations used to explore cancer gene signaling pathways. In public health it’s being used to interpret millions of medical records to identify optimal treatment strategies. In infectious disease research machine learning methods are being used to predict antibiotic resistance and to identify novel antibiotic resistance mechanisms that might be present. More generally machine learning is emerging as a general tool to augment and extend mechanistic models in biology and many other fields. It’s becoming an important component of scientific workloads. From a computational architecture standpoint, deep neural network (DNN) based scientific applications have some unique requirements. They require high compute density to support matrix-matrix and matrix-vector operations, but they rarely require 64bit or even 32bits of precision, thus architects are creating new instructions and new design points to accelerate training. Most current DNNs rely on dense fully connected networks and convolutional networks and thus are reasonably matched to current HPC accelerators. However future DNNs may rely less on dense communication patterns. Like simulation codes power efficient DNNs require high-bandwidth memory be physically close to arithmetic units to reduce costs of data motion and a high-bandwidth communication fabric between (perhaps modest scale) groups of processors to support network model parallelism. DNNs in general do not have good strong scaling behavior, so to fully exploit large-scale parallelism they rely on a combination of model, data and search parallelism. Deep learning problems also require large-quantities of training data to be made available or generated at each node, thus providing opportunities for NVRAM. Discovering optimal deep learning models often involves a large-scale search of hyperparameters. It’s not uncommon to search a space of tens of thousands of model configurations. Naïve searches are outperformed by various intelligent searching strategies, including new approaches that use generative neural networks to manage the search space. HPC architectures that can support these large-scale intelligent search methods as well as efficient model training are needed.
09:30 | A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling ( abstract ) |
09:55 | 4CeeD: Real-Time Data Acquisition and Analysis Framework for Material-related Cyber-Physical Environments ( abstract ) |
11:00 | Data Locality Strategies for Iterative MapReduce Applications in Hybrid Clouds ( abstract ) |
11:30 | Leveraging Renewable Energy in Edge Clouds for Data Stream Analysis in IoT ( abstract ) |
12:00 | DOTA: Delay Bounded Optimal Cloudlet Deployment and User Association in WMANs ( abstract ) |
11:00 | Mitigating YARN Container Overhead with Input Splits ( abstract ) |
11:30 | Parallel Variable Selection for Effective Performance Prediction ( abstract ) |
12:00 | Towards Big Data Analytics across Multiple Clusters ( abstract ) |
11:00 | CloudSight: A tenant-oriented transparency framework for cross-layer cloud troubleshooting ( abstract ) |
11:30 | CBase: A New Paradigm for Fast Virtual Machine Migration across Data Centers ( abstract ) |
12:00 | Deploying High Throughput Scientific Workflows on Container Schedulers with Makeflow and Mesos ( abstract ) |
Michela Täufer (University of Delaware, USA)
11:00 | Massively Parallel Simulations of Spread of Infectious Diseases over Realistic Social Networks ( abstract ) |
11:20 | Scaling HDFS to more than 1 million operations per second with HopsFS ( abstract ) |
11:40 | Scaling a Convolutional Neural Network for classification of Adjective Noun Pairs with TensorFlow on GPU Clusters ( abstract ) |
12:00 | mD3DOCKxb: An Ultra-Scalable CPU-MIC Coordinated Virtual Screening Framework ( abstract ) |
12:20 | Scalable Assembly for Massive Genomic Graphs ( abstract ) |
14:00 | APHiD: Hierarchical task placement to enable a tapered fat tree topology for lower power and cost in HPC networks ( abstract ) |
14:25 | Swift-X: Accelerating OpenStack Swift with RDMA for Building an Efficient HPC Cloud ( abstract ) |
14:50 | Offloading communication control logic in GPU accelerated applications ( abstract ) |
15:15 | Preliminary Performance Analysis of Multi-rail Fat-tree Networks ( abstract ) |
15:30 | SynAPTIC: Secure And Persistent connecTIvity for Containers ( abstract ) |
14:00 | Optimal Resource Configuration of Complex Services in the Cloud ( abstract ) |
14:25 | Modeling Distributed Platforms from Application Traces for Realistic File Transfer Simulation ( abstract ) |
14:50 | Performance Modelling and Cost Effective Execution for Distributed Graph Processing on Configurable VMs ( abstract ) |
15:15 | Fine-grained Nested Virtual Machine Performance Analysis Through First Level Hypervisor Tracing ( abstract ) |
14:00 | A new on-line method for scheduling independent tasks ( abstract ) |
14:30 | A Two-Stage Multi-Objective Optimization of Erasure Coding in Overlay Networks ( abstract ) |
15:00 | Multi-dimensional admission control and capacity planning for IaaS clouds with multiple service classes ( abstract ) |
15:30 | A Robust Tabu Search Heuristic for VM Consolidation under Demand Uncertainity in Virtualized Datacenters ( abstract ) |
16:30 | Efficient Event Correlation over Distributed Systems ( abstract ) |
17:00 | Efficient Cache Update for In-Memory Cluster Computing with Spark ( abstract ) |
17:30 | GPU in-memory processing using Spark for iterative computation ( abstract ) |
16:30 | Preemptive Software Transactional Memory ( abstract ) |
17:00 | Supporting Fault-Tolerance in Presence of In-Situ Analytics ( abstract ) |
17:30 | Advanced Thread Synchronization for Multithreaded MPI Implementations ( abstract ) |
16:30 | Pattern-Directed Replication Scheme for Heterogeneous Object-based Storage ( abstract ) |
17:00 | A New File System I/O Mode for Efficient User-level Caching ( abstract ) |
17:30 | COPS: Cost Based Object Placement Strategies on Hybrid Storage System for DBaaS Cloud ( abstract ) |
19:00 | Using the Jetstream Research Cloud to provide Science Gateway resources ( abstract ) |
19:00 | Lemonade: A scalable and efficient Spark-based platform for data analytics ( abstract ) |
19:00 | Massive Data Load on Distributed Database Systems over HBase ( abstract ) |
19:00 | Techniques for Handling Error in User-estimated Execution Times During Resource Management on Systems Processing MapReduce Jobs ( abstract ) |
19:00 | Privacy Preserving Data Outsourcing via Homomorphic Secret Splitting Schemes ( abstract ) |
19:00 | Representing Variant Calling Format as Directed Acyclic Graphs to enable the use of cloud computing for efficient and cost effective genome analysis ( abstract ) |
19:00 | IBM Research Hybrid Cloud ( abstract ) |
19:00 | EffiEye: Application-aware Large Flow Detection in Data Center ( abstract ) |
19:00 | Load and Video Performance Patterns of a Cloud Based WebRTC Architecture ( abstract ) |
19:00 | Extending Message Passing Interface Windows to Storage ( abstract ) |
19:00 | Performance Optimization by Dynamicly Altering Cache Replacement Algorithm in CPU-GPU Heterogeneous Multi-Core Architecture ( abstract ) |
19:00 | Multi-agent recommendation system in Internet of Things ( abstract ) |
19:00 | BBQ: Elastic MapReduce over Cloud Platforms ( abstract ) |
19:00 | TuNao: A High-Performance and Energy-Efficient Reconfigurable Accelerator for Graph Processing ( abstract ) |
19:00 | Mermaid: Integrating Vertex-Centric with Edge-Centric for Real-World Graph Processing ( abstract ) |
19:00 | A Live Demo for Showing the Benefits of Applying the Remote GPU Virtualization Technique to Cloud Computing ( abstract ) |
19:00 | AURA: Recovering from Transient Failures in Cloud Deployments ( abstract ) |
19:00 | DSA: Scalable Distributed Sequence Alignment System Using SIMD Instructions ( abstract ) |
View this program: with abstractssession overviewtalk overview
Abstract
After a brief review on HPC research and development under China’s high-tech R&D program in the past years, this talk will introduce the new key project on high performance computing in the national key R&D Program of China in the 13th 5-year plan. The major challenges and technical issues in developing the exa-scale system will be discussed. The goal and the major activities, as well as the current status, of the new key project will be presented.
09:30 | Combating the Bandits in the Cloud: A Moving Target Defense Approach ( abstract ) |
09:55 | Chrysaor: Fine-Grained, Fault-Tolerant Cloud-of-Clouds MapReduce ( abstract ) |
11:00 | Unveiling the Interplay Between Global Link Arrangements and Network Management Algorithms on Dragonfly Networks ( abstract ) |
11:20 | Application-Agnostic Power Monitoring in Virtualized Environments ( abstract ) |
11:40 | A performance study of UCX over InfiniBand ( abstract ) |
12:00 | Performance Modelling and Verification of Cloud-based Auto-Scaling Policies ( abstract ) |
11:00 | Dynamic Management of In-memory Storage for Efficiently Integrating Compute- and Data-intensive Computing on HPC Systems ( abstract ) |
11:30 | Energy-efficient I/O Thread Schedulers for NVMe SSDs on NUMA ( abstract ) |
12:00 | Enabling Distributed Software-Defined Environments Using Dynamic Infrastructure Service Composition ( abstract ) |
11:00 | Interconnect Your Future: Paving the Road to Exascale ( abstract ) |
11:45 | Intel Parallel Studio XE - a powerful set of compilers and tools (not only) for HPC and a short outlook to Intel's ideas of AI ( abstract ) |
11:00 | Data-Aware Support for Hybrid HPC and Big Data Applications ( abstract ) |
11:20 | Secure cloud storage service for detection of security violations ( abstract ) |
11:40 | Optimization of checkpoints and execution model for an implementation of OpenMP on distributed memory architectures ( abstract ) |
12:00 | Dynamic Resource Management Across Cloud-Edge Resources for Performance-Sensitive Applications ( abstract ) |
14:00 | Joint Optimization of Scaling and Placement of Virtual Network Services ( abstract ) |
14:20 | Acyclic Partitioning of Large-Scale Directed Acyclic Graphs ( abstract ) |
14:40 | Towards Energy Budget Control in HPC ( abstract ) |
15:00 | On Estimating Minimum Bids for Amazon EC2 Spot Instances ( abstract ) |
15:20 | Practical Service Placement Approach for Microservices Architecture ( abstract ) |
14:00 | Security Implications of Blockchain Cloud with Analysis of Block Withholding Attack ( abstract ) |
14:25 | ProvChain: A Blockchain-based Data Provenance Architecture in Cloud Environment with Enhanced Privacy and Availability ( abstract ) |
14:50 | T-VMI: Trusted Virtual Machine Introspection in Cloud Environments ( abstract ) |
15:15 | Modeling correlation between QoS attributes for trust computation in cloud computing environments ( abstract ) |
15:40 | Crowdsourced Data Integrity Verification for Key-Value Stores in the Cloud ( abstract ) |
16:30 | An Anomaly Detection Fabric for Clouds Based on Collaborative VM Communities ( abstract ) |
16:55 | LOGAIDER: A tool for mining potential correlations of HPC log events ( abstract ) |
17:20 | Designing and Modelling Selective Replication for Fault-tolerant HPC Applications ( abstract ) |
16:30 | High-Performance Key-Value Store On OpenSHMEM ( abstract ) |
16:50 | AnalyzeThat: A Programmable Shared-Memory System for an Array of Processing-In-Memory Devices ( abstract ) |
17:10 | Implementation and Evaluation of One-sided PGAS Communication in XcalableACC for Accelerated Clusters ( abstract ) |
17:30 | Combining Both a Component Model and a Task-based Model for HPC Applications: a Feasibility Study on GYSELA ( abstract ) |
16:30 | Towards Distributed Software-Defined Environments ( abstract ) |
17:00 | Cable-geometric error-prone approach for low-latency interconnection networks ( abstract ) |
17:30 | Enhancing the rCUDA Remote GPU Virtualization Framework: from a Prototype to a Production Solution ( abstract ) |
View this program: with abstractssession overviewtalk overview
Abstract
The algorithms underlying numerical weather prediction (NWP) and climate models that have been developed in the past few decades face an increasing challenge caused by the paradigm shift imposed by hardware vendors towards more energy-efficient devices. In order to provide a sustainable path to Exascale High Performance Computing (HPC), applications become increasingly restricted by energy consumption. As a result, the emerging diverse and complex hardware solutions have a large impact on the programming models traditionally used in NWP software, triggering a rethink of design choices for future massively parallel software frameworks. To this end, the ECMWF is leading the ESCAPE project, a European funded project involving regional NWP centres, Universities, HPC centres and hardware vendors. The aim is to combine interdisciplinary expertise for defining and co-designing the necessary steps towards affordable, Exascale high-performance simulations of weather and climate.
10:30 | Adaptive Hybrid Queue Configuration for Supercomputer Systems ( abstract ) |
11:00 | Flexible Scheduling of Distributed Analytic Applications ( abstract ) |
11:30 | CtrlCloud: Performance-Aware Adaptive Control for Shared Resources in Clouds ( abstract ) |
12:00 | QoS-Aware Virtual Infrastructures Allocation on SDN-based Clouds ( abstract ) |
10:30 | Maximum Sustainable Throughput Prediction for Data Stream Processing over Public Clouds ( abstract ) |
10:55 | WattsKit: Software-Defined Power Monitoring of Distributed Systems ( abstract ) |
11:20 | Predicting cloud performance for HPC applications: a user-oriented approach ( abstract ) |
11:45 | An Approach and Case Study of Cloud Instance Type Selection for Multi-Tier Web Applications ( abstract ) |
12:10 | ENOS: a Holistic Framework for Conducting Scientific Evaluations of OpenStack ( abstract ) |
10:30 | Energy Efficient Algorithm for VNF Placement and Chaining ( abstract ) |
11:00 | KPI-agnostic Control for Fine-Grained Vertical Elasticity ( abstract ) |
11:30 | PCSsampler: Sample based, Private-state Cluster Scheduling ( abstract ) |
12:00 | Optimized Cloud Deployment of Multi-tenant Software Considering Data Protection Concerns ( abstract ) |
Javier Garcia Blas (Carlos III University, Spain)
Manish Parashar (State University of New Jersey University, USA)