To solve the problems of feature extraction, data dimensionality reduction, gradient dissipation, low training efficiency, and poor real-time performance of CNN in fault classification and diagnosis.