Our client is looking for a Data Scientist with experience working with Big Data pipelines and tools in a linux-based environment to train, improve and evaluate product-grade language packs for their Voice Biometric solutions. This is an opportunity to work as a Data and Language engineer, working closely with a team of speech experts within a dedicated team to train and productise models for their Voice Biometric solutions.
We are looking for someone with a degree in Computer Sciences, Computational Linguistics, Speech Recognition, Machine Learning, or similar fields like Physics or Mathematics, or equivalent work experience.
The ideal candidate would also have:
- Experience with creation and handling big data pipelines, pre- and post-processing of data.
- Experience with languages and tools such as Matlab, Python, Perl, etc.
- Fluency with Unix/Linux, shell scripting.
- Understanding of speech recognition, voice biometrics technology, machine learning and deep neural networks is a plus.