Wednesday, March 22, 2023
SCIENMAG: Latest Science and Health News
No Result
View All Result
  • Login
  • HOME PAGE
  • BIOLOGY
  • CHEMISTRY AND PHYSICS
  • MEDICINE
    • Cancer
    • Infectious Emerging Diseases
  • SPACE
  • TECHNOLOGY
  • CONTACT US
  • HOME PAGE
  • BIOLOGY
  • CHEMISTRY AND PHYSICS
  • MEDICINE
    • Cancer
    • Infectious Emerging Diseases
  • SPACE
  • TECHNOLOGY
  • CONTACT US
No Result
View All Result
Scienmag - Latest science news from science magazine
No Result
View All Result
Home Latest News

A new approach to automatically tune the resource configurations for streaming data processing systems using machine learning

January 10, 2023
in Latest News
0
Share on FacebookShare on Twitter

Data can be likened to a stream of water when a large amount of data is generated continuously. A variety of data including applications, networked devices, server log files, various online activities, and location-based data can form a continuous stream. We call such a form of data processing stream data. In streaming data, various types of data sources can be collected, managed, stored, analyzed in real time and provided with information. For most scenarios where dynamic new data is continuously generated, it is beneficial to adopt streaming data processing, which is suitable for most industries and big data use cases.

Stream data processing systems are used to analyze stream data. There are already many stream data processing systems that are widely used by companies, such as Apache Flink, Apache Storm, Spark Streaming, and Apache Heron. These stream data processing applications are characterized by large deployments and long run times (months or even years) in applications, and each application runs with different data, so even small performance improvements can have significant financial benefits for companies. To improve system performance, resource configuration parameters need to be tuned to specify the amount of resources such as CPU cores and memory used in tasks. But selecting key configuration parameters and finding their optimal values for stream data processing applications is very challenging, and manually tuning these parameters is extremely time-consuming. For a single unknown application, a performance engineer, who has a deep understanding on the stream data processing system, may take several days or even weeks to find its optimal resource configuration.

In order to solve the above problem, researchers have started to apply machine learning methods to conduct research. A study was published in Intelligent Computing on Oct. 6. The authors used the Apache Flink program as an experimental stream data processing application. The machine learning approach was used to automatically and efficiently tune the resource allocation parameters for the stream data processing application. It applies a Random Forest algorithm to build a highly accurate performance model for a stream data processing program that outputs the tail latency or throughput of the application, taking the speed of input data and key configuration parameters as input. In addition, the machine learning approach leverage the Bayesian optimization algorithm (BOA) to iteratively search the high-dimensional resource configuration space to achieve optimal performance.

This approach has been experimentally shown to significantly improve the 99th-percentile tail latency and throughput. The method proposed in this study is a parameter-tuning tool independent of the Flink system, and can be integrated into other stream processing systems, such as Spark Streaming and Apache Storm.



DOI

10.34133/2022/9820424

Article Title

Resource Configuration Tuning for Stream Data Processing Systems via Bayesian Optimization

Tags: approachautomaticallyconfigurationsdatalearningmachineprocessingresourcestreamingsystemstune
Share25Tweet16Share4ShareSendShare
  • Bacterial communities in the penile urethra

    Healthy men who have vaginal sex have a distinct urethral microbiome

    94 shares
    Share 38 Tweet 24
  • Spotted lanternfly spreads by hitching a ride with humans

    87 shares
    Share 35 Tweet 22
  • Small but mighty: new superconducting amplifiers deliver high performance at lower power consumption

    83 shares
    Share 33 Tweet 21
  • Cyprus’s copper deposits created one of the most important trade hubs in the Bronze Age

    86 shares
    Share 34 Tweet 22
  • Researchers highlight nucleolar DNA damage response in fight against cancer

    72 shares
    Share 29 Tweet 18
  • Promoting healthy longevity should start young: pregnancy complications lift women’s risk of mortality in the next 50 years

    66 shares
    Share 26 Tweet 17
ADVERTISEMENT

About us

We bring you the latest science news from best research centers and universities around the world. Check our website.

Latest NEWS

Healthy men who have vaginal sex have a distinct urethral microbiome

Spotted lanternfly spreads by hitching a ride with humans

Artificial pancreas developed at UVA improves blood sugar control for kids ages 2-6, study finds

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Join 205 other subscribers

© 2023 Scienmag- Science Magazine: Latest Science News.

No Result
View All Result
  • HOME PAGE
  • BIOLOGY
  • CHEMISTRY AND PHYSICS
  • MEDICINE
    • Cancer
    • Infectious Emerging Diseases
  • SPACE
  • TECHNOLOGY
  • CONTACT US

© 2023 Scienmag- Science Magazine: Latest Science News.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In