DE eng

Search in the Catalogues and Directories

Hits 1 – 5 of 5

1
Hand-gesture recognition based on EMG and event-based camera sensor fusion: a benchmark in neuromorphic computing
In: ISSN: 1662-4548 ; EISSN: 1662-453X ; Frontiers in Neuroscience ; https://hal.archives-ouvertes.fr/hal-02617084 ; Frontiers in Neuroscience, Frontiers, 2020, pp.36 ; https://www.frontiersin.org/ (2020)
BASE
Show details
2
Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing
In: Frontiers in Neuroscience, 14 (2020)
BASE
Show details
3
Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing
In: Front Neurosci (2020)
BASE
Show details
4
Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing
In: Ceolini, Enea; Frenkel, Charlotte; Shrestha, Sumit Bam; Taverni, Gemma; Khacef, Lyes; Payvand, Melika; Donati, Elisa (2020). Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing. Frontiers in Neuroscience, 14:637. (2020)
BASE
Show details
5
Written and spoken digits database for multimodal learning
In: https://hal.archives-ouvertes.fr/hal-02327938 ; 2019 (2019)
Abstract: The written and spoken digits database is not a new database but a constructed database from existing ones, in order to provide a ready-to-use database for multimodal fusion.The written digits database is the original MNIST handwritten digits database [1] with no additional processing. It consists of 70000 images (60000 for training and 10000 for test) of 28 x 28 = 784 dimensions.The spoken digits database was extracted from Google Speech Commands [2], an audio dataset of spoken words that was proposed to train and evaluate keyword spotting systems. It consists of 105829 utterances of 35 words, amongst which 38908 utterances of the ten digits (34801 for training and 4107 for test). A pre-processing was done via the extraction of the Mel Frequency Cepstral Coefficients (MFCC) with a framing window size of 50 ms and frame shift size of 25 ms. Since the speech samples are approximately 1 s long, we end up with 39 time slots. For each one, we extract 12 MFCC coefficients with an additional energy coefficient. Thus, we have a final vector of 39 x 13 = 507 dimensions. Standardization and normalization were applied on the MFCC features.To construct the multimodal digits dataset, we associated written and spoken digits of the same class respecting the initial partitioning in [1] and [2] for the training and test subsets. Since we have less samples for the spoken digits, we duplicate some random samples to match the number of written digits and have a multimodal digits database of 70000 samples (60000 for training and 10000 for test).The dataset is provided in six files as described below. Therefore, if a shuffle is performed on the training or test subsets, it must be performed in unison with the same order for the written digits, spoken digits and labels.
Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-AR]Computer Science [cs]/Hardware Architecture [cs.AR]
URL: https://hal.archives-ouvertes.fr/hal-02327938
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
5
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern