Quantitative Performance Assessment of Distributed Machine Learning Frameworks for Real-Time Financial Analytics in Enterprise Data Platforms

MD Fardous; Md. Morshedul Islam

doi:10.63125/ky027n68

Authors

MD Fardous MS in Information Technology (Data Management & Analytics); Washington University of Science and Technology, USA Author
Md. Morshedul Islam MS in Information Technology (Data Management & Analytics); Washington University of Science and Technology, USA Author

DOI:

https://doi.org/10.63125/ky027n68

Keywords:

Distributed Machine Learning, Financial Analytics, Enterprise Data Platforms, Performance Benchmarking, Real-Time Analytics

Abstract

The increasing volume and velocity of financial data generated in modern enterprise environments have created significant demand for scalable analytical infrastructures capable of supporting real-time financial analytics. Distributed machine learning frameworks have emerged as essential technologies for processing large-scale financial datasets across cluster-based computing environments. This study conducted a quantitative performance assessment of distributed machine learning frameworks used in enterprise data platforms for real-time financial analytics. An experimental benchmarking design was implemented to evaluate the computational performance of three distributed machine learning frameworks operating on enterprise-scale financial datasets. The experimental dataset consisted of 120 performance observations obtained from repeated analytical workloads executed across a distributed cluster infrastructure. Key performance indicators examined in the study included computational throughput, model training duration, analytical response latency, and system resource utilization. The results demonstrated measurable differences in performance across the evaluated frameworks. Framework A achieved the highest average computational throughput of 14.6 GB per minute, while Framework B and Framework C processed financial datasets at average throughputs of 12.3 GB and 11.5 GB per minute respectively. Model training efficiency also varied across frameworks, with Framework A completing distributed training tasks in an average of 34.2 minutes compared with 39.8 minutes for Framework B and 41.7 minutes for Framework C. Real-time analytical response latency averaged 198 milliseconds for Framework A, while Framework B and Framework C recorded response delays of 221 milliseconds and 232 milliseconds respectively. Scalability testing further indicated that throughput performance improved as cluster capacity increased from 4 nodes to 16 nodes, with Framework A achieving a maximum throughput of 18.4 GB per minute under expanded cluster configurations. Statistical analysis using analysis of variance confirmed that the observed performance differences were statistically significant at the p < 0.05 level. Effect size analysis also indicated moderate to large differences across the evaluated performance metrics. The findings demonstrate that distributed machine learning frameworks differ substantially in their ability to support enterprise financial analytics workloads. These results provide empirical evidence supporting the importance of framework architecture, distributed resource coordination, and scalability capabilities in determining the efficiency of large-scale financial analytics systems operating within enterprise data platforms.