Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 4 of 4

1	What does the Canary Say? Low-Dimensional GAN Applied to Birdsong
	Pagliarini, Silvia; Trouvain, Nathan; Leblois, Arthur...
	In: https://hal.inria.fr/hal-03244723 ; 2021 (2021)
	BASE
	Show details

2	Which Hype for my New Task? Hints and Random Search for Reservoir Computing Hyperparameters
	Hinaut, Xavier; Trouvain, Nathan
	In: ICANN 2021 - 30th International Conference on Artificial Neural Networks ; https://hal.inria.fr/hal-03203318 ; ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia (2021)
	BASE
	Show details

3	Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs
	Trouvain, Nathan; Hinaut, Xavier
	In: https://hal.inria.fr/hal-03203374 ; 2021 (2021)
	BASE
	Show details

4	Canary Song Decoder: Transduction and Implicit Segmentation with ESNs and LTSMs
	Trouvain, Nathan; Hinaut, Xavier
	In: ICANN 2021 - 30th International Conference on Artificial Neural Networks ; https://hal.inria.fr/hal-03203374 ; ICANN 2021 - 30th International Conference on Artificial Neural Networks, Sep 2021, Bratislava, Slovakia. pp.71--82, ⟨10.1007/978-3-030-86383-8_6⟩ ; https://link.springer.com/chapter/10.1007/978-3-030-86383-8_6 (2021)
	Abstract: International audience ; Domestic canaries produce complex vocal patterns embed- ded in various levels of abstraction. Studying such temporal organization is of particular relevance to understand how animal brains represent and process vocal inputs such as language. However, this requires a large amount of annotated data. We propose a fast and easy-to-train trans- ducer model based on RNN architectures to automate parts of the anno- tation process. This is similar to a speech recognition task. We demon- strate that RNN architectures can be efficiently applied on spectral fea- tures (MFCC) to annotate songs at time frame level and at phrase level. We achieved around 95% accuracy at frame level on particularly complex canary songs, and ESNs achieved around 5% of word error rate (WER) at phrase level. Moreover, we are able to build this model using only around 13 to 20 minutes of annotated songs. Training time takes only 35 seconds using 2 hours and 40 minutes of data for the ESN, allowing to quickly run experiments without the need of powerful hardware.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; [SCCO.LING]Cognitive science/Linguistics; [SDV.NEU]Life Sciences [q-bio]/Neurons and Cognition [q-bio.NC]; Audio Classification; Birdsong; Echo State Networks; Long Short Terms Memory; MFCC; RNN
	URL: https://hal.inria.fr/hal-03203374 https://hal.inria.fr/hal-03203374v2/document https://doi.org/10.1007/978-3-030-86383-8_6 https://hal.inria.fr/hal-03203374v2/file/TrouvainHinaut2021_ICANN_Canary-decoder_HAL-v2.pdf
	BASE
	Hide details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern