de
en
Close
Detailsuche
Bibliotheken
Projekt
Imprint
Privacy Policy
Close
Publizieren
Besondere Sammlungen
Digitalisierungsservice
Hilfe
Impressum
Datenschutz
jump to main content
Search Details
Quicksearch:
OK
Result-List
Title
Title
Content
Content
Page
Page
Search Book
Robust multi-channel speech recognition with neural network supported statistical beamforming / von M.Sc. Jahn Heymann ; Erster Gutachter: Prof. Dr.-Ing. Haeb-Umbach, Zweiter Gutachter: Priv.-Doz. Dr. rer. nat. Ralf Schlüter. Paderborn, 2020
Content
Acknowledgement
Abstract
Zusammenfassung
Introduction
Neural networks
Layers
Fully connected layers
Convolutional layers
Recurrent layers
Training
Update rule
Backpropagation
Computational graphs and automatic differentiation
Wirtinger calculus
Summary
Automatic speech recognition
Statistical ASR
Feature extraction
Acoustic model
Language model
Decoder
End-to-end ASR
Sequence-to-sequence models
RNN transducer
Multi-channel ASR
Summary
Speech signal processing
Signal model
General model
Simplified model
Spectral model
Acoustic beamforming
Physical model
Statistical model
Spatial covariance matrix estimation
Statistical mask estimation
Dereverberation
Dereverberation with beamforming
Weighted predictive error
Summary
Datasets, setup and baselines
Datasets
CHiME
REVERB
Evaluation setup
Baselines
Summary
Contributions
Robust multi-channel ASR with neural network supported beamforming
Neural network mask estimation
Wide Residual BLSTM Network acoustic model
Training
Mask estimator
Acoustic model
Evaluation
Acoustic model
cACGMM vs. neural network based mask estimator
Performance over SNR
Comparison of different beamformers
Combination with WPE
Comparison between BLSTM and U-Net
Array independence
Related work
Summary
Reducing latencies
Mask estimator
LSTM
Instance norm feedforward network
Scale invariant feedforward network
Beamformer
ASR back-end
Evaluation
Online mask estimation
Online front-end
Online system
Discussion
Summary
Unsupervised neural mask estimator training
cACGMM likelihood loss
cACGMM teacher
Evaluation
Optimization criterion
Comparison with oracle target training
Softmax vs. Sigmoid
Comparison with teacher-student training
Summary
Joint optimization
Backpropagating gradients
Evaluation
Initial experiment
Research questions
Impact of the training data
Model analysis
Performance on REVERB
Answers
Summary
Summary
Appendix
Gradient for the Cholesky factorization
Gradient for the complex valued eigenvalue decomposition
Reproducibility
Symbols and notation
List of Figures
List of Tables
Acronyms
Bibliography
Own publications
The search-operation requires javascript to be activated.