AI Program Generates Realistic Facial Animations from Audio and Face Photos

Researchers from Nanyang Technological University (NTU) in Singapore unveiled an advanced computer program called DIverse yet Realistic Facial Animations (DIRFA), crafting lifelike videos with facial expressions and head movements.

To ensure realistic and synchronized facial animations with spoken audio, the innovative artificial intelligence (AI) tool seamlessly generates 3D videos by utilizing just an audio clip and a face photo.

US-TECHNOLOGY-AI-DEEPFAKES-DISINFORMATION-MISINFORMATION

(Photo : OLIVIER DOULIERY/AFP via Getty Images)
This illustration photo taken on January 30, 2023 shows a phone screen displaying a statement from the head of security policy at META with a fake video (R) of Ukrainian President Volodymyr Zelensky calling on his soldiers to lay down their weapons shown in the background, in Washington, DC.

Training DIRFA To Make Realistic Talking Faces

Generating realistic facial expressions from audio signals presents a complex challenge due to the multitude of possibilities for a given signal, especially when dealing with temporal sequences. The team's objective was to develop talking faces that exhibit precise lip synchronization, rich facial expressions, and natural head movements corresponding to the provided audio.

While audio has strong associations with lip movements, its connections with facial expressions and head positions are weaker, prompting the creation of DIRFA-an AI model designed to capture intricate relationships between audio signals and facial animations.

To tackle this complexity, the team trained DIRFA on an extensive dataset comprising over one million audio and video clips collected from more than 6,000 individuals, sourced from a publicly available database. The model was engineered to predict the likelihood of specific facial animations, like raised eyebrows or wrinkled noses, based on input audio.

This approach allows DIRFA to transform audio input into diverse, highly realistic sequences of facial animations, guiding the generation of talking faces with accurate lip movements, expressive facial features, and natural head poses.

According to Assoc. Prof. Lu, DIRFA's modeling capabilities enable the program to generate talking faces that authentically synchronize with audio, offering accurate lip movements and natural expressions. The team is actively refining DIRFA's interface to provide users with more control over specific outputs.

Notably, users will soon be able to adjust certain expressions, such as transforming a frown into a smile. Additionally, the researchers are committed to further enhancing DIRFA's capabilities by incorporating a wider range of datasets, ensuring it accommodates diverse facial expressions and voice audio clips for more comprehensive and versatile applications.

DIRFA Program Enhances Facial Animations with AI and Machine Learning

The developed DIRFA program from NTU addresses challenges in existing methods related to pose variations and emotional control. Researchers envision DIRFA having diverse applications, including healthcare, where it could enhance virtual assistants and chatbots, providing more sophisticated and realistic user experiences.

Additionally, DIRFA could be a valuable tool for individuals with speech or facial disabilities, enabling them to communicate thoughts and emotions through expressive avatars or digital representations.

Corresponding author Associate Professor Lu Shijian, the study's lead from NTU's School of Computer Science and Engineering, emphasized the profound impact of their research on multimedia communication.

DIRFA, incorporating AI and machine learning techniques, enables the creation of highly realistic videos featuring accurate lip movements, vivid facial expressions, and natural head poses, using only audio recordings and static images.

First author Dr. Wu Rongliang, a Ph.D. graduate from NTU's SCSE, highlighted the program's pioneering efforts in enhancing AI and machine learning performance, particularly in audio representation learning. The findings were published in the scientific journal Pattern Recognition in August.

RELATED ARTICLE: Scientists Try To Teach Robots When, How To Laugh Like A Human [LOOK]

Check out more news and information on Artificial Intelligence in Science Times.

AI Program Generates Realistic Facial Animations from Audio and Face Photos

Training DIRFA To Make Realistic Talking Faces

DIRFA Program Enhances Facial Animations with AI and Machine Learning

Most Popular

1,029-Foot Asteroid To Make Close Approach Toward Earth at Breakneck Speed This Week

Alien Life on K2-18b: Planet That Emits Gas Produced Only by Life Sparks Interest Among Astronomers

Allergy Season: Allergist Alerts of Hay Fever as 'Pollen Storm' Rises Across US

RSS Tool "Saber" Set to Revolutionize the Oil and Gas Industry

NASA Plans To Bring Pressurized Rovers to the Lunar Surface for Artemis VII Moon Mission

Latest Stories

1,029-Foot Asteroid To Make Close Approach Toward Earth at Breakneck Speed This Week

Why Can’t We See Ultraviolet Light? Exploring the Limits of Human Vision

9-Year-Old Wally the Emotional Support Alligator Stolen; What Is the Purpose of ESA?

Man Finds Out He Has 98% More Neanderthal Genes Than Average Person After DNA Test

10 US States With Higher Chances of Wildfire in 2024 Revealed; Check the Map to See If Your Area Is Safe

Recommended Stories

NASA’s Voyager 1 Resumes Sending Readable Data From Deep Space After Months of Transmitting Nonsense Responses

Small Dark Formations on the Red Planet's Surface Appear Like Spider in Photos Captured by ESA's Mars Express

Citizen Scientists Identify New Sample of Over 1,000 Hidden Asteroids in Hubble’s Archival Data

NASA Engineers Successfully Repair Voyager 1 After 5 Months of Troubleshooting 15 Billion Miles Away

AI Program Generates Realistic Facial Animations from Audio and Face Photos

Training DIRFA To Make Realistic Talking Faces

DIRFA Program Enhances Facial Animations with AI and Machine Learning

Most Popular

Latest Stories

Subscribe to The Science Times!

Recommended Stories