基於時頻感知域經由深度信念網路之吉他彈奏技巧辨識
吉他是非常常見的樂器,被廣泛運用於流行音樂、搖滾樂、民謠…等,學習吉他成為許多人的興趣。而不同吉他彈奏技巧能夠表現不同聲音、展示不同情緒,進而拼湊成一幅樂章。
吉他彈奏技巧的變化相當細微,欲將其分類、辨識是具有挑戰性的工作。對於不熟悉吉他的人而言,技巧聽起來十分相像;而會彈吉他的人,便能單憑聆聽就區分出不同技巧。
面對彈奏技巧些微的變化,本研究提出以深度學習網路(Deep Belief Networks, DBN)學習音訊特徵,包含梅爾倒頻譜系數(MFCCs)及大腦皮質組織(spectro-temporal receptive field),藉由不同初始化方法與新提出的深度學習網路架構,學習找出相對關鍵的特徵增加辨識效果,並使用完整音檔和Onset部分進行比較。實驗結果顯示,本研究提出之方法於Onset部分最高提升11.74%之辨識率,而完整音檔的部分,辨識率更為精準,到達0.9819。說明有效運用特徵參數及辨認器,相較於大量參數,更能準確分類資訊。
關鍵字: 聽覺模型,吉他彈奏技巧,分類,辨識,類神經網路,深度學習
Recognition of Guitar Playing Techniques with Deep Belief Networks based on Spectral-Temporal Receptive Fields
Abstract
Guitar is a very common instrument which has been widely used in popular music, rock, ballad, etc. Different guitar playing technique can perform various vocal, express different emotion, then play the wonderful music. Some of guitar playing techniques has only tiny difference. To recognize it is a big challenge. This paper proposed a guitar playing technique recognition system including a novel STRF based feature extraction algorithm and a novel deep learning model called HCDBN. In experiments, the proposed system improves 11.74% recognition rate than baseline system on onset version dataset and achieves 98.19% recognition rate on whole version dataset. This paper also make an onset detection based guitar technique recognition system which can applied in real world guitar solo music.
Keywords: STRF, Guitar Playing Technique, Classification, Recognition, Neural Network, Deep Belief Network