Federated learning has emerged as a transformative paradigm for collaborative model training across decentralized and privacy-sensitive data sources. By enabling multiple clients to jointly learn a global model without direct data sharing, federated learning addresses critical concerns related to data privacy, regulatory compliance, and communication efficiency. However, as real-world deployments expand across heterogeneous devices, applications, and user populations, fundamental challenges have become increasingly evident. Chief among these challenges is the presence of statistical heterogeneity, commonly referred to as non-identically and independently distributed data, which undermines the effectiveness of traditional federated optimization strategies such as Federated Averaging. This article presents an extensive and theoretically grounded exploration of personalized and cluster-aware federated learning as a response to these limitations. Drawing strictly on established literature, the paper synthesizes advances in soft and hard clustering, personalized optimization objectives, representation learning, curriculum strategies, and meta-learning approaches within federated settings. The methodology emphasizes a conceptual and comparative analysis of algorithmic frameworks rather than empirical experimentation, allowing for a deep examination of underlying assumptions, convergence behaviors, fairness implications, and trade-offs between global generalization and local adaptation. The results are presented as a descriptive synthesis of findings reported across prior studies, highlighting consistent patterns such as improved local performance, robustness to heterogeneity, and enhanced user-level fairness when personalization mechanisms are introduced. The discussion critically examines unresolved issues, including scalability, interpretability, privacy leakage risks, and regulatory considerations, while also outlining promising directions for future research. By unifying diverse strands of federated learning research into a coherent analytical narrative, this article aims to provide a comprehensive reference for researchers and practitioners seeking to design federated systems that are both privacy-preserving and adaptive to real-world data diversity.