Last Updated: 5th March 2023
N. Takahashi, M. K. Singh and Y. Mitsufuji, "Cross-modal Face-and Voice-style Transfer"
N. Takahashi, M. K. Singh and Y. Mitsufuji, "Robust One-Shot Singing Voice Conversion"
I am currently working as a Research Scientist in Sony Research India after my 2 year stint in Sony Japan R&D labs. I have been intrigued by machine learning since my high school and have a vision to strive to understand the complex decision process of humans and implement it with the technologies we have, to push humans one step further.
I started my journey with Hyper Spectral Images for cancer detection, pixel level segmentation and since have worked on speech source separation, text-detection and recognition, Quant algoroithms, Reinforcement Learning, Automatic Speech Recognition, Singing/Emotional Speech Voice Conversion, Vocoders and Audio Steganography. It has been a great 6 years exploring this field.