A Survey of Audio Synthesis and Lip-syncing for Synthetic Video Generation

Anup  Kadam; Sagar  Rane; Arpit Kumar  Mishra; Shailesh Kumar  Sahu; Shubham  Singh; Shivam Kumar  Pathak

doi:10.4108/eai.14-4-2021.169187

A Survey of Audio Synthesis and Lip-syncing for Synthetic Video Generation

Authors

Anup Kadam Indian Institute of Information Technology, Pune
Sagar Rane Indian Institute of Information Technology, Pune
Arpit Kumar Mishra Indian Institute of Information Technology, Pune
Shailesh Kumar Sahu Indian Institute of Information Technology, Pune
Shubham Singh Indian Institute of Information Technology, Pune
Shivam Kumar Pathak Indian Institute of Information Technology, Pune

DOI:

https://doi.org/10.4108/eai.14-4-2021.169187

Keywords:

Video Synthesis, Voice Cloning, Lip Synchronization, Video Generation Application

Abstract

The fields like Media, Education and Corporations etc have started focusing on content creation. This has led to the huge demand for synthetic media generation using less data. To synthesize a high-grade artificial video, the lip must be synchronized with the audio. Here we have compared the various methods for voice-cloning and lip synchronization. Voice cloning procedure include state of the art methods like wavenet and other text-to-speech approaches. Lip synchronization methods describe constrained and unconstrained methods. Various recent research like LipGan, Wav2Lip are discussed. The methods are compared and the best method is suggested. Apart from studying and comparing the various methods, their drawbacks, future scopes, and application are also there. Different social and ethical issues are also discussed.

Citations

Citation Indexes: 2

Captures

Readers: 15

see details

Downloads

Published

14-04-2021

How to Cite

Kadam A, Rane S, Mishra AK, Sahu SK, Singh S, Pathak SK. A Survey of Audio Synthesis and Lip-syncing for Synthetic Video Generation. EAI Endorsed Trans Creat Tech [Internet]. 2021 Apr. 14 [cited 2025 Jun. 18];8(28):e2. Available from: https://publications.eai.eu/index.php/ct/article/view/1417

Download Citation

Issue

Vol. 8 No. 28 (2021): EAI Endorsed Transactions on Creative Technologies

Section

Research article

License

This is an open-access article distributed under the terms of the Creative Commons Attribution CC BY 3.0 license, which permits unlimited use, distribution, and reproduction in any medium so long as the original work is properly cited.

A Survey of Audio Synthesis and Lip-syncing for Synthetic Video Generation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Current Issue

Keywords