

With the advent of the big data era, the explosive growth of data volume has put enormous computing and storage pressure on electric power companies. As a powerful technical means, supercomputing clouds are widely used in data processing, storage, and online services. However, critical services in supercomputing clouds are often deployed with overprovisioned resources to ensure the quality of service for users, resulting in significant energy consumption and additional costs. At the same time, insufficient resources for service provisioning can lead to performance degradation and service violations. To this end, this paper proposes an automatic resource scaling management system on electric power supercomputing clouds. Specifically, the proposed system is based on Transformer’s long-sequence prediction model to predict the future load intensity of the service and calculate the number of instances required by the service in the future through the runtime service requirement estimation component, thus automatically scaling resources and minimizing resource costs. Experimental results show that our system achieves the best scaling behavior based on performance metrics and the lowest cost overhead compared to strong baselines.