Experienced senior researcher, consultant, software developer, and lecturer, actively participating in diverse commercial and scientific research initiatives, specialized in cutting-edge speech technologies, speech and image processing, HCI, deep learning algorithms, and AI. Currently holds the position of a leading researcher and head of the automatic speech recognition (ASR) team, playing a pivotal role in crafting ASR systems tailored for Serbian and related South Slavic languages, including their commercial applications for medical and juridical dictation, and a voice assistant mobile application, alongside the creation of various speech resources. Notable achievements also include pioneering the first high-quality speech synthesizer for Hebrew. Managed multiple project activities within technical development projects. Previously served as an associate professor and a vice-dean for artistic and scientific research work at the Academy of Arts in Belgrade. Founder and owner of the Computer Programming Agency Code85.
A dedicated professional, interested in the broad field of machine learning, data mining, and human-computer interaction. Proficient in various IDEs, programming languages, and software tools. Published numerous scientific papers in esteemed journals and conference proceedings, internationally applied technical solutions, and patents. Actively contributing to the scientific community as a member of scientific committees for several national and international conferences and a reviewer for international scientific journals. Holds a Ph.D. degree as the youngest doctor of technical sciences from the Faculty of Technical Sciences in Novi Sad. Demonstrates exceptional social skills, organizational abilities, and a strong propensity for design. With more than a decade of practical and industrial experience, serves as a regular member of the Centre of Excellence CEVAS, a group for Acoustics and Speech Technology, and the leader of the innovation working group of the Serbian AI Society.
Project "ELOQUENCE: Multilingual and Cross-Cultural Interactions for Context-Aware, and Bias-Controlled Dialogue Systems for Safety-Critical Applications", Grant agreement ID 101135916, HORIZON-CL4-2023-HUMAN-01-CNECT (January 2024 - Present)
- Project coordinator (UNS)
Project "AI-SPEAK: Multimodal Multilingual Human-Machine Speech Communication", Grant No. 7449, Science Fund of the Republic of Serbia (January 2024 - Present)
- Head of the ASR team
Project "Innovative Scientific and Artistic Research from the Faculty of Technical Sciences Activity Domain", MESTD No. 451-03-68/2020-14/200156 (January 2020 - December 2023)
Project "MARVEL: Multimodal Extreme Scale Data Analytics for Smart Cities Environments", Grant agreement ID 957337, H2020-EU.2.1.1. (January 2021 - December 2023)
Project "S-ADAPT: Speaker/Style Adaptation for Digital Voice Assistants Based on Image Processing Methods", Grant No. 6524560, Science Fund of the Republic of Serbia (September 2020 - February 2023)
- Head of the ASR team
Project "SENVIBE: Strengthening Educational Capacities by Building Competences and Cooperation in the Field of Noise and Vibration Engineering", no. 598241-EPP-1-2018-1-RS-EPPKA2-CBHE-JP (November 2018 - November 2022)
- Key staff member
Project "Development of Dialogue Systems for Serbian and Other South Slavic Languages", id TR32035 (January 2011 - December 2019)
- Leader of several project activities (LVCSR, VAD)
Project "DANSPLAT: A Platform for the Applications of Speech Technologies on Smartphones for the Languages of the Danube Region", id Е! 9944 (January 2016 - January 2019)
- Senior researcher
Project "SP2: SCOPES Project on Speech Prosody", SNSF no. IZ73Z0_152495 (April 2014 - March 2016)
Project "S-VERIFY: Advanced Speaker Verification", id Е! 8719 (January 2014 - September 2016)
Project "Central Audio-Library of the University of Novi Sad (CABUNS)", PSNTR No. 114-451-2570/2016-02 (May 2016 - March 2020)
Project "Audio Library for the Disabled (ABOSI)", PSNTR No. 114-451-2210/2011-04 (May 2015 - December 2015)
Description
Continuous speech recognition, language and acoustic modeling, image processing, emotion recognition, machine learning algorithms, deep learning, data mining, and artificial intelligence.
Head of the ASR team.
Project management (implementation, structure, organization, budget).
Occasionally engaged in teaching (Human-Machine Speech Communication, Selected Chapters in Acoustics and Audio Engineering, Acoustics and Audio Engineering, Acoustics and Audio Engineering in Multimedia, Digital Audio Signal Processing, Optical Telecommunications, Electroacoustics).
Regular member of the Centre for Vibro-Acoustic Systems and Signal Processing (CEVAS), group for Acoustics and Speech Technology, accredited as the Centre of Excellence by the National Council for Scientific and Technological Development of the Republic of Serbia on 18 May 2015 and again on 26 February 2020 (October 2014 - Present). Member of IEEE Computational Intelligence Society and IEEE Computer Society (December 2015 - Present).
Project "Digital Audio Signals Processing", AlfaNum - Speech Technologies Ltd (December 2017 - Present)
Project "MEDICTA: Development of Systems for Dictation of Medical Findings in Bosnian/Croatian/Serbian including Latin Expressions", Grant agreement no. 825003, Horizon 2020, DIH-HERO Technology Transfer Experiment Call 2020 (2021 - 2022)
Project "Automatic Speech Recognition System for Dictating Medical Findings", Pension and Disability Insurance Fund of the Republic of Serbia, Contract no. 404.3-399/19 (August 2019 - December 2020)
- Authorized representative
Products and services
Voice Assistant Application for the Serbian Language, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia)
Automatic Speech Recognition System for Dictating Medical Findings, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia) for the Pension and Disability Insurance Fund of the Republic of Serbia (client)
"100 reasons for 1 click", ASR server for IVR, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia) in cooperation with partners for the Government of the Republic of Serbia (client)
"MEDICTA", A system for dictation of medical findings in Bosnian/Croatian/Serbian including Latin expressions, DIH-HERO Technology Transfer Experiment Call 2020, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia)
"MEDICTA", A system for dictation of medical findings, including stripe mode, dictionary, templates, and personalization options, developed by Alfanum - Speech Technologies Ltd (Novi Sad, Serbia)
Description
Computer programming and consulting services, application design (GUI and architecture) and development.
Signal processing, classification, and recognition, data analyses, deep learning, artificial intelligence.
Description
Serbian Artificial Intelligence Society, SAIS, is a society promoting AI research and the development of applications in the artificial intelligence industry. Members are Serbian AI companies, researchers, decision-makers, entrepreneurs, organizations, professionals, and students active in, or interested in the area of artificial intelligence.
Description
Axioms is an international, peer-reviewed, open-access journal of mathematics, mathematical logic and mathematical physics, published monthly online by MDPI.
Editor of the special issue entitled "Recent Advances of Computational and Mathematical Applications in Deep Learning".
Description
Creating appropriate conditions for artistic and scientific research activities, overseeing, and analysing outcomes.
Developing curricula and reports, harmonizing contents and results with established strategic plans, national and EU higher education standards, and integrating them into the teaching process.
Implementing corrective measures and participating in publishing activities.
Engaged in lecturing (Audio Engineering, Physical and Physiological Acoustics, Electroacoustics, Applied Acoustics, Spatial Acoustics with Sound Reinforcement).
Description
Engaged in lecturing (Artificial Intelligence, Multimedia Information Systems).
Description
Continuous speech recognition and synthesis, speaker identification, human-computer interaction, speech segmentation (for Speech Morphing, Inc.).
Coaching, organization, and supervision of ASR team members.
Software development and design for Windows / Linux / Android.
Description
Engaged in teaching and laboratory practice (Audio Engineering).
Description
Phrase analysis and evaluation, input reception, lexicon retrieval, preprocessing, part-of-speech tagging, reading selection, phonetic reconstruction, data encryption, code optimization, end-user applications (Aharon TTS).
In cooperation with the Faculty of Technical Sciences, Novi Sad and AlfaNum - Speech Technologies Ltd.
Project "Development of Dialogue Systems for Serbian and Other South Slavic Languages", id TR32035 (January 2011 - December 2019)
Project "Human-Machine Speech Communication", id TR11001 (October 2009 - December 2010)
Description
Research and development, clustering algorithms, digital signal processing, advanced statistic, emotion recognition, speech and image processing, speech recognition (ASR), and speech synthesis (TTS) for Serbian and Hebrew.
Engaged in the training of young researchers.
Periodically engaged as а teaching assistant up to eight hours per week (Automatic Speech Recognition and Synthesis, Design of Spatial Forms).
Core Network and Services, Network Operations
Description
GSM, UMTS, WCDMA, SMS, MMS, SS7 Protocol, Wireless network architecture, roaming, call tracking, base station repairs and integration, creating reports, on-site training.
Computer engineering department (RT-RK)
Description
Measurement of harmonics at low SNR, SAADK converters, DSP.