Scope and Topics of Interest: Processing large datasets for extracting information and knowledge has always been a fundamental problem. Today this problem is further exacerbated, as the data a researcher or a company needs to cope with can be immense in terms of volume, distributed in terms of location, and unstructured in terms of format. Recent advances in computer hardware and storage technologies have allowed us to gather, store, and analyze such large–scale data. However, without scalable and cost effective algorithms that utilize the resources in an efficient way, neither the resources nor the data itself can serve to science and society at its full potential.<br>Analyzing Big Data requires a vast amount of storage and computing resources. We need to untangle the big, puzzling information we have and while doing this, we need to be fast and robust: the information we need may be crucial for a life–or–death situation. We need to be accurate: a single misleading information extracted from the data can cause an avalanche effect. Each problem has its own characteristic and priorities. Hence, the best algorithm and architecture combination is different for different applications.<br>This workshop aims to bring people who work on data–intensive and high performance computing in industry, research labs, and academia together to share their problems posed by the Big Data in various application domains and knowledge required to solve them.<br>All novel data–intensive computing techniques, data storage and integration schemes, and algorithms for cutting–edge high performance computing architectures which targets the utilization of Big Data are of interest to the workshop. Examples of topics include but not limited to<br>– parallel algorithms for data–intensive applications,<br>– scalable data and text mining and information retrieval,<br>– using Hadoop, MapReduce, Spark, Storm, Streaming to analyze Big Data,<br>– energy–efficient data–intensive computing,<br>– deep–learning with massive–scale datasets<br>– querying and visualization of large network datasets,<br>– processing large–scale datasets on clusters of multicore and manycore processors, and accelerators,<br>– heterogeneous computing for Big Data architectures,<br>– Big Data in the Cloud,<br>– processing and analyzing high–resolution images using high–performance computing,<br>– using hybrid infrastructures for Big Data analysis.<br>
Abbrevation
HPC4BD
City
PhiladelphiaPA
Country
United States
Deadline Paper
Start Date
End Date
Abstract