Use Microsoft Teams Intelligent Speakers to identify in-room participants in a meeting transcription
If your organization's Microsoft Teams Rooms are equipped with Intelligent Speakers, you can hold meetings where in-room participants can be identified in live transcription.
During the meeting, all participants can easily see who’s saying what, and the post-meeting transcript identifies both remote and in-room attendees, except any who choose not to be identified. Without speaker recognition, audio will be attributed to the room in AI notes.
In this article
Set up your digital voice profile
Update or remove your voice profile
Room experience with and without speaker recognition
Supported regions and languages
Set up your digital voice profile
To set up your voice profile, you can use the desktop version of Teams on a Windows or Mac device.
Note: Once you've set up your voice profile, you can still participate in meetings where a different language is spoken.
Before creating a voice profile, make sure your language is supported in Teams.
To create a voice profile:
-
In Teams, select Settings and more > Settings > Recognition.
-
Select Create voice profile.
-
Make sure your mic is selected from the device dropdown menu.
-
Select Start voice capture and read the text.
Tip: Record your voice profile in a quiet location, using a high-quality microphone for best results.
-
Select End voice capture when you're done.
-
You can start face recognition setup right after you're done creating your voice profile, or select Close.
Room experience with and without speaker recognition
With speaker recognition set up, individuals in the conference room will each be attributed by name in AI notes.
Without speaker recognition set up, audio will be attributed to the room in AI notes.
Update or remove your voice profile
You can re-record your voice profile if the Intelligent Speaker is having difficulty recognizing your voice.
If you remove your voice profile, your speech won't be identified in future meetings.
-
In Teams, select Settings and more > Settings > Recognition.
-
Under Voice, select Update to re-record your voice or select Remove.
Disable speaker recognition during the meeting
You can disable speaker recognition during the meeting for everyone using the Microsoft Teams Room console.
After you've joined the meeting, select More options at the bottom of the console, and select Turn off voice identification.
Supported regions and languages
Intelligent Speaker is available in all countries and regions.
The language of the Teams app you've installed determines the voice enrollment languages. These are the localized versions that are available:
Language |
Country/Region |
Culture ID |
---|---|---|
Arabic |
Saudi Arabia |
ar-sa |
Chinese (Simplified) |
China |
zh-cn |
Chinese |
Taiwan |
zh-tw |
Danish |
Denmark |
da-dk |
Dutch |
Netherlands |
nl-nl |
English |
Australia |
en-au |
English |
Canada |
en-ca |
English |
India |
en-in |
English |
New Zealand |
en-nz |
English |
United Kingdom |
en-gb |
English |
United States |
en-us |
Finnish |
Finland |
fi-fi |
French |
Canada |
fr-ca |
French |
France |
fr-fr |
German |
Germany |
de-de |
Italian |
Italy |
it-it |
Japanese |
Japan |
ja-jp |
Norwegian |
Norway |
nb-no |
Polish |
Poland |
pl-pl |
Portuguese |
Brazil |
pt-br |
Russian |
Russia |
ru-ru |
Spanish |
Spain |
es-es |
Spanish |
Mexico |
es-mx |
Swedish |
Sweden |
sv-se |
Note: We are actively expanding our language support across different regions. If your preferred Teams language is not listed in the table above, we will soon automatically switch you to the closest available language based on linguistic and regional similarities. Meanwhile, please manually choose a language from the currently supported list.
At Microsoft, we take your privacy and security extremely seriously. Our commitment is to ensure that your data is handled with the highest standards of protection and transparency.
You have ultimate control: You have the authority to decide how your data is used. You must first enroll and opt-in to provide your data. Your consent is required for any use of your data. Please review the overview of face and voice enrollment if you have any questions regarding voice profile data usage and storage.
Informed consent: You provide explicit consent for your data to be used, ensuring you are always in control of your personal information.
Opt-out any time and data deletion: You have the option to withdraw your consent and stop the use of your data at any time. If you choose to unenroll, your data will be deleted, ensuring it is no longer stored or used.
Where is the data stored? Your data is stored in the same region as your Microsoft Teams data, ensuring adherence to regional data sovereignty and privacy regulations.
Secure storage in Microsoft Cloud: The data, such as voice signatures for meeting rooms feature, is stored securely within your organization’s tenant in the Microsoft Cloud and locally on the user's device for Voice Isolation. This data is managed in accordance with Microsoft's stringent data protection standards.
Data access: Access to data is highly regulated within Microsoft, supported by stringent security protocols that ensure privacy and prevent this data from being shared with third parties.
Why aren't I being identified? I've set up my voice profile and my speech is clearly transcribed.
After the meeting, try updating your voice profile.
What are the requirements for inviting meeting attendees to use Intelligent Speakers?
Each meeting attendee must be invited individually, on the original invite or through a forwarded invitation.
What is my voice profile used for?
Your voice profile is only used in ways you've already consented to. Microsoft won't use your voice profile without your permission.
How many people can be on the invitation list?
There is no limitation on the number of people in the invitation list. However, voice identification is only available in meetings with up to 20 people with enrolled voices.
When will my voice profile be deleted?
Your voice profile will be deleted after one year of no use.
What should I do if I can't access certain features?
If you can't access certain features, please contact your IT admin. To learn more, see Manage voice recognition technology controls for an Intelligent Speaker
What's the recommended room size for Intelligent Speakers?
Intelligent Speakers work best in medium-sized rooms that hold 8–10 people.
Want to know more? Please review the face and voice enrollment document if you have any questions regarding voice profile data usage and storage.