9
submitted 2 months ago by 5714@lemmy.dbzer0.com to c/ai_@lemmy.world
top 3 comments
sorted by: hot top controversial new old
[-] CameronDev@programming.dev 2 points 2 months ago

Isnt this a solved problem? Cisco video conference things do this already, their cameras will swivel around to point at the active speaker.

[-] venusaur@lemmy.world 1 points 2 months ago

I think this is different. Cisco's audio triangulation: "Audio triangulation - The microphone array behind the fabric panel that is position behind the camera pictured above is able to accurately locate voices within the room. The microphones are only used for audio triangulation ."

the robot is different because it's using binaural which uses the two "microphone" on our head (ears) and if doing it accurately, it should calculate how the sound is being received based on the pinnae and other shapes of the ear.

[-] venusaur@lemmy.world 1 points 2 months ago

emphasis on the estimating

this post was submitted on 11 Apr 2024
9 points (90.9% liked)

Artificial Intelligence

1231 readers
1 users here now

Welcome to the AI Community!

Let's explore AI passionately, foster innovation, and learn together. Follow these guidelines for a vibrant and respectful community:

You can access the AI Wiki at the following link: AI Wiki

Let's create a thriving AI community together!

founded 1 year ago