Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.
Speaker enrollment api enrolls user for Speaker Identification Api and Realtime Speaker Identification Api.
POST https://proxy.api.deepaffects.com/audio/generic/api/v1/sync/diarization/enroll
curl -X POST "https://proxy.api.deepaffects.com/audio/generic/api/v1/sync/diarization/enroll?apikey=<API_KEY>" -H 'content-type: application/json' -d @data.json
# contents of data.json
{"content": "bytesEncodedAudioString", "sampleRate": 8000, "encoding": "FLAC", "languageCode": "en-US", "speakerId": "user1" }# Sync:
{
"message": "Success"
}For every successfull enrollment the response will containe message as "Success".
Repeat the enrollment with different audios untill the status message changes to "Complete". Then proceed with speaker identification
Enroll a user atleast thrice with 3 different audio, each about 10-12 seconds. The more diverse the enrollment audio files, the better the accuracy for identification.
| Parameter | Type | Description | Notes |
|---|---|---|---|
| encoding | String | Encoding of audio file like MP3, WAV etc. | |
| sampleRate | Number | Sample rate of the audio file. | |
| languageCode | String | Language spoken in the audio file. | [default to 'en-US'] |
| content | String | base64 encoding of the audio file. | |
| speakerId | String | speaker id tobe registered |
| Parameter | Type | Description | Notes |
|---|---|---|---|
| api_key | String | The apikey | Required for authentication inside all requests |
| Parameter | Type | Description | Notes |
|---|---|---|---|
| message | String | Status of enrollment Success or Complete | Success: Current enrollment is successfull, Complete: Enrollment is completed, Repeat the enrollments with different audio samples until Complete message is received |
This API deletes speaker enrollment for the user
POST https://proxy.api.deepaffects.com/audio/generic/api/v1/sync/diarization/delete
curl -X POST "https://proxy.api.deepaffects.com/audio/generic/api/v1/sync/diarization/delete?apikey=<API_KEY>" -H 'content-type: application/json' -d @data.json
# contents of data.json
{"speakerId": "user1"}# The above command returns output:
{
"message": "Success"
}| Parameter | Type | Description | Notes |
|---|---|---|---|
| speakerId | String | speaker id to be registered |
| Parameter | Type | Description | Notes |
|---|---|---|---|
| api_key | String | The apikey | Required for authentication inside all requests |
| Parameter | Type | Description | Notes |
|---|---|---|---|
| message | String | Request status | Success or Failure |
This API lists all the enrolled speakers enrolled for a developer along with enrollment status
GET https://proxy.api.deepaffects.com/audio/generic/api/v1/sync/diarization/get_enrolled_speakers
curl -X GET "https://proxy.api.deepaffects.com/audio/generic/api/v1/sync/diarization/get_enrolled_speakers?apikey=<API_KEY>"
```shell
# The above command returns output:
{
"developer_id": "testuser",
"enrolled_speaker_ids": [
{
"speaker_id": "speaker_1",
"enrollment_complete" "True"
}
]
}| Parameter | Type | Description | Notes |
|---|---|---|---|
| apikey | String | The apikey | Required for authentication inside all requests |
DeepAffects is a speech analysis platform for Developers. We offer a number of speech analysis apis like, Speech Enhancement, Multi-Speaker Diarization, Emotion Recognition, Voice-prints, Conversation Metrics etc. For more information, checkout our developer portal