Mel-RoFormer: A Versatile Deep Neural Network for Vocal Separation and Vocal Melody Transcription
Mel-RoFormer, a spectrogram-based model with a novel Mel-band Projection module and interleaved RoPE Transformers, achieves state-of-the-art performance in vocal separation and vocal melody transcription tasks.