Speaker Diarization

Found super interesting article about speaker diarization. Guys build full framework to separate speakers on audio online.
The paper can be found here.
The repository is here.

Other interesting articles on topic:

  1. https://www.infoq.com/news/2018/11/Google-AI-Voice/
  2. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5996252/
  3. http://theconversation.com/human-voices-are-unique-but-our-study-shows-were-not-that-good-at-recognising-them-79520
  4. https://towardsdatascience.com/automatic-speaker-recognition-using-transfer-learning-6fab63e34e74
  5. https://medium.com/@ageitgey/machine-learning-is-fun-part-6-how-to-do-speech-recognition-with-deep-learning-28293c162f7a
  6. https://medium.com/linagoralabs/voice-activity-detection-for-voice-user-interface-2d4bb5600ee3
  7. https://medium.com/linagoralabs/computing-mfccs-voice-recognition-features-on-arm-systems-dae45f016eb6
  8. https://towardsdatascience.com/beginners-guide-to-speech-analysis-4690ca7a7c05
  9. https://towardsdatascience.com/speech-recognition-is-hard-part-1-258e813b6eb7
  10. https://medium.com/linagoralabs/voice-activity-detection-for-voice-user-interface-2d4bb5600ee3
  11. https://towardsdatascience.com/ok-google-how-to-do-speech-recognition-f77b5d7cbe0b
  12. https://towardsdatascience.com/recognizing-speech-commands-using-recurrent-neural-networks-with-attention-c2b2ba17c837

Comments

Popular posts from this blog

Install Kubeflow locally

RabbitMQ and OpenShift