Help with audio classification model
Hello I hope this is the right place to post this.
I am doing a task where I have to train some models to classify audio as either real or AI generated. I am using precomputed mfcc features. The issue is the models are taking some sort shortcut and are getting ridiculously good performance metrics. Could anyone identify something that is specific to AI generated songs that the models could be taking a shortcut on? And how could I solve this?
I am desperate
Edit: I am using classical ML models (e.g random forest)
u/Intrepid-Fish-289 — 4 days ago