Senior researcher, consultant, software developer and lecturer, engaged in multiple commercial and scientific research projects (H2020, ERASMUS+, EUREKA, etc.) focused on cutting-edge speech technologies, speech and image processing, human-computer interaction, deep learning algorithms and artificial intelligence. Leading researcher in the area of automatic speech recognition and head of the ASR team. As a chief programmer and research consultant contributed to the development of the first high-quality speech synthesizer for Hebrew. As a researcher, programming developer and consultant made a key contribution to the development of ASR systems for Serbian and kindred South Slavic languages and a voice assistant mobile application for the Serbian language. Managed various project activities at technical development projects. Previously part-time engaged as an associate professor and a vice-dean for artistic and scientific-research work at the Academy of Arts in Belgrade. Founder and owner of the Computer Programming Agency Code85.
Interested in the broad field of machine learning, data mining and human-computer interaction. Experienced in various IDEs, programming languages and software tools. Published numerous scientific papers in journals and conference proceedings, internationally applied technical solutions and patents. Member of scientific committees of several national and international conferences. Acted as a reviewer for a number of international scientific journals. Obtained his Ph.D. degree as the youngest doctor of technical sciences at the Faculty of Technical Sciences in Novi Sad. Gained over a decade of practical and industrial experience. Possesses outstanding social skills and organizational abilities, and a strong propensity for design. Regular member of the Centre of Excellence CEVAS, a group for Acoustics and Speech Technology. Leader of the innovation working group of the Serbian AI Society.
Project "Innovative Scientific and Artistic Research from the Faculty of Technical Sciences Activity Domain", MESTD No. 451-03-68/2020-14/200156 (January 2020 - Present)
Project "MARVEL: Multimodal Extreme Scale Data Analytics for Smart Cities Environments", Grant agreement ID 957337, H2020-EU.2.1.1. (January 2021 - Present)
Project "S-ADAPT: Speaker/Style Adaptation for Digital Voice Assistants Based on Image Processing Methods", Grant No. 6524560, Science Fund of the Republic of Serbia (September 2020 - February 2023)
- Head of the ASR team
Project "SENVIBE: Strengthening Educational Capacities by Building Competences and Cooperation in the Field of Noise and Vibration Engineering", no. 598241-EPP-1-2018-1-RS-EPPKA2-CBHE-JP (November 2018 - November 2022)
- Key staff member
Project "Development of Dialogue Systems for Serbian and Other South Slavic Languages", id TR32035 (January 2011 - December 2019)
- Leader of several project activities (LVCSR, VAD)
Project "DANSPLAT: A Platform for the Applications of Speech Technologies on Smartphones for the Languages of the Danube Region", id Е! 9944 (January 2016 - January 2019)
- Senior researcher
Project "SP2: SCOPES Project on Speech Prosody", SNSF no. IZ73Z0_152495 (April 2014 - March 2016)
Project "S-VERIFY: Advanced Speaker Verification", id Е! 8719 (January 2014 - September 2016)
Project "Central Audio-Library of the University of Novi Sad (CABUNS)", PSNTR No. 114-451-2570/2016-02 (May 2016 - March 2020)
Project "Audio Library for the Disabled (ABOSI)", PSNTR No. 114-451-2210/2011-04 (May 2015 - December 2015)
Description
Continuous speech recognition, language and acoustic modelling, image processing, emotion recognition, machine learning algorithms, deep learning, data mining, and artificial intelligence.
Head of the ASR team.
Project management (implementation, structure, organization, budget).
Occasionally engaged in teaching (Human-Machine Speech Communication, Selected Chapters in Acoustics and Audio Engineering, Acoustics and Audio Engineering, Acoustics and Audio Engineering in Multimedia, Digital Audio Signal Processing, Optical Telecommunications, Electroacoustics).
Regular member of the Centre for Vibro-Acoustic Systems and Signal Processing (CEVAS), group for Acoustics and Speech Technology, accredited as the Centre of Excellence by the National Council for Scientific and Technological Development of the Republic of Serbia on 18 May 2015 and again on 26 February 2020 (October 2014 - Present). Member of IEEE Computational Intelligence Society and IEEE Computer Society (December 2015 - Present).
Project "MEDICTA: Development of Systems for Dictation of Medical Findings in Bosnian/Croatian/Serbian including Latin Expressions", Grant agreement no. 825003, Horizon 2020, DIH-HERO Technology Transfer Experiment Call 2020 (2021 - 2022)
Project "Digital Audio Signals Processing", AlfaNum - Speech Technologies Ltd (December 2017 - Present)
Project "Automatic Speech Recognition System for Dictating Medical Findings", Pension and Disability Insurance Fund of the Republic of Serbia, Contract no. 404.3-399/19 (August 2019 - December 2020)
- Authorized representative
Products and services
Voice Assistant Application for the Serbian Language, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia)
Automatic Speech Recognition System for Dictating Medical Findings, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia) for the Pension and Disability Insurance Fund of the Republic of Serbia (client)
"100 reasons for 1 click", ASR server for IVR, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia) in cooperation with partners for the Government of the Republic of Serbia (client)
"MEDICTA", A system for dictation of medical findings in Bosnian/Croatian/Serbian including Latin expressions, DIH-HERO Technology Transfer Experiment Call 2020, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia)
Description
Computer programming and consulting services, application design (GUI and architecture) and development.
Signal processing, classification and recognition, data analyses, deep learning, artificial intelligence.
Description
Serbian Artificial Intelligence Society, SAIS, is a society promoting AI research and the development of applications in the artificial intelligence industry. Members are Serbian AI companies, researchers, decision-makers, entrepreneurs, organizations, professionals and students active in, or interested in the area of artificial intelligence.
Description
Providing appropriate conditions for artistic and scientific research activities, monitoring and analysis of results.
Preparing curricula and reports, harmonizing contents and results with the adopted strategic plans, national and EU standards of higher education and their inclusion in the teaching process, corrective measures and publishing activities.
Engaged in lecturing (Audio Engineering, Physical and Physiological Acoustics, Electroacoustics, Applied Acoustics, Spatial Acoustics with Sound Reinforcement).
Description
Engaged in lecturing (Artifitial Intelligence, Multimedia Information Systems).
Description
Continuous speech recognition and synthesis, speaker identification, human-computer interaction, speech segmentation (for Speech Morphing, Inc.).
Coaching, organization and supervision of ASR team members.
Software development and design for Windows / Linux / Android.
Description
Engaged in teaching and laboratory practice (Audio Engineering).
Description
Phrase analysis and evaluation, input reception, lexicon retrieval, preprocessing, part-of-speech tagging, reading selection, phonetic reconstruction, data encryption, code optimization, end-user applications (Aharon TTS).
In cooperation with the Faculty of Technical Sciences, Novi Sad and AlfaNum - Speech Technologies Ltd.
Project "Development of Dialogue Systems for Serbian and Other South Slavic Languages", id TR32035 (January 2011 - December 2019)
Project "Human-Machine Speech Communication", id TR11001 (October 2009 - December 2010)
Description
Research and development, clustering algorithms, digital signal processing, advanced statistic, emotion recognition, speech and image processing, speech recognition (ASR) and speech synthesis (TTS) for Serbian and Hebrew.
Engaged in the training of young researchers.
Periodically engaged as а teaching assistant up to eight hours per week (Automatic Speech Recognition and Synthesis, Design of Spatial Forms).
Core Network and Services, Network Operations
Description
GSM, UMTS, WCDMA, SMS, MMS, SS7 Protocol, Wireless network architecture, roaming, call tracking, base station repairs and integration, creating reports, on-site training.
Computer engineering department (RT-RK)
Description
Measurement of harmonics at low SNR, SAADK converters, DSP.