Comprehensive Review on the Intersection of Big Data Analytics and Machine Learning in the Era of Generative AI

Authors

  • Manish Adawadkar Independent Researcher, Kelley School of Business, Indiana University, Bloomington, USA

Keywords:

Big Data Analytics, Machine Learning, Generative AI, Distributed Computing, LLMs, Data Augmentation, GANs, Cloud Computing

Abstract

The accelerated growth of generative AI has transformed the relationship between Big Data Analytics (BDA) and Machine Learning (ML), enabling intelligent data-driven systems with unprecedented scalability, automation, and representation learning capability. This paper presents a comprehensive review of how BDA pipelines integrate with classical machine learning, deep learning, and modern generative AI systems such as Generative Adversarial Networks (GANs) and large language models (LLMs). A detailed examination of system modules—including data ingestion, distributed storage, feature engineering, model training, generative augmentation, and deployment—is presented to understand their role in modern analytics ecosystems. Although the paper discusses generative AI trends, all literature references are restricted to work published before December 2023. The proposed system architecture demonstrates how organizations can combine big data infrastructure with generative AI-driven ML pipelines to enhance decision-making, synthetic data generation, automation, and enterprise intelligence.

Downloads

Published

2024-01-30

How to Cite

Comprehensive Review on the Intersection of Big Data Analytics and Machine Learning in the Era of Generative AI. (2024). American Journal of Engineering , Mechanics and Architecture (2993-2637), 2(1), 98-108. https://grnjournal.us/index.php/AJEMA/article/view/2891