Core Concepts
Large language models (LLMs) can be effectively applied to bearing fault diagnosis, enhancing generalization across diverse operating conditions, limited sample sizes, and different bearing datasets.
Abstract
Bibliographic Information:
Tao, L., Liu, H., Ning, G., Cao, W., Huang, B., & Lu, C. (n.d.). LLM-based Framework for Bearing Fault Diagnosis.
Research Objective:
This paper investigates the potential of large language models (LLMs) for improving the generalization capabilities of bearing fault diagnosis systems, addressing challenges related to cross-condition adaptability, small-sample learning, and cross-dataset generalization.
Methodology:
The authors propose two novel LLM-based frameworks for bearing fault diagnosis: a feature-based approach and a data-based approach.
Feature-based approach:
- Feature Extraction: Time-domain and frequency-domain features are extracted from raw vibration signals.
- Textualization: Extracted features are converted into a textual format understandable by LLMs.
- Fine-tuning: A pre-trained LLM (ChatGLM2-6B-chat) is fine-tuned using the textualized features and corresponding fault labels. LoRA and QLoRA techniques are employed for efficient fine-tuning.
Data-based approach:
- Patching: Vibration signals are segmented into patches to reduce redundancy and focus on local patterns.
- Embedding: Patches are converted into LLM input dimensions using value and position embeddings.
- Fine-tuning: A pre-trained LLM (GPT-2) is fine-tuned using the embedded data, with frozen attention and FFN layers to leverage pre-trained knowledge. Instance normalization and learnable affine transformations are applied for better knowledge transfer.
The performance of both frameworks is evaluated on four public bearing fault diagnosis datasets: CWRU, MFPT, JNU, and PU. Experiments include single-dataset, single-dataset cross-condition, complete-data cross-dataset, and limited-data cross-dataset scenarios.
Key Findings:
- Both feature-based and data-based LLM frameworks demonstrate promising results in bearing fault diagnosis.
- The proposed methods exhibit strong generalization capabilities, effectively handling cross-condition, small-sample, and cross-dataset scenarios.
- Multi-dataset training further enhances the knowledge transfer and generalization ability of the LLM-based models.
- Fine-tuning with limited data from a new dataset, leveraging knowledge from previous datasets, shows significant improvement compared to training solely on the limited data.
Main Conclusions:
This study highlights the potential of LLMs for advancing bearing fault diagnosis by overcoming limitations of traditional methods in terms of generalization. The proposed frameworks provide a novel and effective approach for accurate and adaptable fault diagnosis in complex real-world applications.
Significance:
This research contributes to the growing field of applying LLMs to time-series analysis and specifically addresses the critical challenge of bearing fault diagnosis in industrial settings. The findings have significant implications for improving the reliability, safety, and efficiency of rotating machinery maintenance.
Limitations and Future Research:
- The study primarily focuses on four specific bearing datasets. Further validation on a wider range of datasets is necessary to confirm the generalizability of the proposed methods.
- Exploring the integration of other advanced techniques, such as meta-learning and few-shot learning, with the LLM-based frameworks could further enhance their performance in data-scarce scenarios.
- Investigating the interpretability of LLM-based fault diagnosis models is crucial for gaining insights into the decision-making process and building trust in their predictions.
Stats
The CWRU dataset includes faults with depths of 0.007 inches, 0.014 inches, and 0.021 inches.
The MFPT dataset includes data from three normal bearings, 3+7 outer race fault bearings, and seven inner race fault bearings.
The JNU dataset has a sampling frequency of 50 kHz and includes three rotational speeds: 600 rpm, 800 rpm, and 1000 rpm.
The PU dataset uses data from 12 artificially damaged bearings and six normal bearings, with a sampling frequency of 64 kHz.
For the feature-based LLM, 12 time-domain and 12 frequency-domain features were selected.
For the data-based LLM, a patch size of 128 and a stride of 8 were chosen for a balance of accuracy and training time.
In the limited data transfer experiment, the model was fine-tuned with 10% of the new dataset.