Labellerr’s audio analyser would help the AI scientist obtain an accurate transcription of the audio recording. Also, the AI scientist can obtain the essence of the conversation, the next steps after the conversation, and things like that. Gemini API, configured to send responses to the UI of the platform, sends measured analysis of data that has been integrated with Labellerr. (The user has to enter the right question in the prompt window). Below is a sample audio file analysed by Labellerr’s Audio analyser.
The template configuration setup is located in the ‘Label Configuration’ section on the project creation page.How to Create an Audio Project?
After setting up classification questions and creating the project, you will be directed to the dashboard. Click on the ‘Label’ tab to open the labeling screen.
Audio Labelling Screen
Next, click ‘LLM Powered Transcript.’ A small popup will appear where you can enter prompts or questions, such as how many people are in the audio, full transcription, highlighted keywords, summary of the audio, and more.
LLM Powered Transcript Prompt
Click “Proceed” after entering your prompt. The generative AI will then provide a transcription/details based on your input.
Generative AI Transcription Output
You can review and edit this output as needed.
Further Prompts: You can provide additional prompts to obtain more specific details about the audio, such as the transcription of a particular person (male/female/first person, etc.).
Once satisfied with the description/transcription, click the “Submit” button to submit the file. The file will be sent to a reviewer, who can also view the audio file and edit the output if necessary. Reviewers can access files for review by navigating to the “Review” page from the dashboard.