Introduction
Post-call transcription and summary is a powerful feature provided by VideoSDK that allows users to generate detailed transcriptions and summaries of recorded meetings after they have concluded. This feature is particularly beneficial for capturing and documenting important information discussed during meetings, ensuring that nothing is missed and that there is a comprehensive record of the conversation.
How Post-Call Transcription Works?
Post-call transcription involves processing the recorded audio or video content of a meeting to produce a textual representation of the conversation. Here’s a step-by-step breakdown of how it works:
- Recording the Meeting: During the meeting, the audio and video are recorded. This can include everything that was said and any shared content, such as presentations or screen shares.
- Uploading the Recording: Once the meeting is over, the recorded file is uploaded to the VideoSDK platform. This can be done automatically or manually, depending on the configuration.
- Transcription Processing: The uploaded recording is then processed by VideoSDK’s transcription engine. This engine uses advanced speech recognition technology to convert spoken words into written text.
- Retrieving the Transcription: After the transcription process is complete, the textual representation of the meeting is made available. This text can be accessed via the VideoSDK API and used in various applications.
Benefits of Post-Call Transcription
- Accurate Documentation: Provides a precise record of what was discussed, which is invaluable for meeting minutes, legal documentation, and reference.
- Enhanced Accessibility: Makes content accessible to those who may have missed the meeting or have hearing impairments.
- Easy Review and Analysis: Enables quick review of key points and decisions made during the meeting without having to re-watch the entire recording.
Let's Get started
VideoSDK empowers you to seamlessly integrate the video calling feature into your React application within minutes.
In this quickstart, you'll explore the group calling feature of VideoSDK. Follow the step-by-step guide to integrate it within your application.
Prerequisites
Before proceeding, ensure that your development environment meets the following requirements:
- VideoSDK Developer Account (Not having one, follow VideoSDK Dashboard)
- Basic understanding of JavaScript.
- JavaScript VideoSDK
- Have Node and NPM installed on your device.
- Generate a token from the VideoSDK dashboard
Getting Started with the Code!
Follow the steps to create the environment necessary to add video calls to your app. You can also find the code sample for quickstart here.
First, create one empty project using mkdir folder_name
on your preferred location.
Install Video SDK
Import VideoSDK using the <script>
tag or Install it using the following npm command. Make sure you are in your app directory before you run this command.
<html>
<head>
<!--.....-->
</head>
<body>
<!--.....-->
<script src="https://sdk.videosdk.live/js-sdk/0.0.89/videosdk.js"></script>
</body>
</html>
Structure of the project
Your project structure should look like this.
root
├── index.html
├── config.js
├── index.js
You will be working on the following files:
- index.html: Responsible for creating a basic UI.
- config.js: Responsible for storing the token.
- index.js: Responsible for rendering the meeting view and the join meeting functionality.
Step 1: Design the user interface (UI)
Create an HTML file containing the screens, join-screen
and grid-screen
.
Output
Step 2: Implement Join Screen
Configure the token in the config.js
file, which you can obtain from the VideoSDK Dashbord.
Next, retrieve all the elements from the DOM and declare the following variables in the index.js
file. Then, add an event listener to the join and create meeting buttons.
Step 3: Initialize the meeting
Following that, initialize the meeting using the initMeeting()
function and proceed to join the meeting.
Output
Step 4: Create the Media Elements
In this step, Create a function to generate audio and video elements for displaying both local and remote participants. Set the corresponding media track based on whether it's a video or audio stream.
Step 5: Handle participant events
Thereafter, implement the events related to the participants and the stream.
The following are the events to be executed in this step:
participant-joined
: When a remote participant joins, this event will trigger. In the event callback, create video and audio elements previously defined for rendering their video and audio streams.participant-left
: When a remote participant leaves, this event will trigger. In the event callback, remove the corresponding video and audio elements.stream-enabled
: This event manages the media track of a specific participant by associating it with the appropriate video or audio element.
Output
Step 6: Implement Controls
Next, implement the meeting controls such as toggleMic, toggleWebcam and leave the meeting.
Step 7: Configuring Transcription
- In this step, we set up the configuration for post-transcription and summary generation. We define the webhook URL where the webhooks will be received.
- In the
startRecording
function, we have passed the transcription object and the webhook URL, which will initiate the post-call transcription process. - Finally, when we call the
stopRecording
function, both the post-call transcription and the recording will be stopped.
Run your code
Once you have completed all the steps mentioned above, run your application using the code block below.
live-server --port=8000
Final Output
You have completed the implementation of a customized video calling app in Javascript using VideoSDK. To explore more features, go through Basic and Advanced features.
Fetching the Transcription from the Dashboard
Once the transcription is ready, you can fetch it from the VideoSDK dashboard. The dashboard provides a user-friendly interface where you can view, download, and manage your Transcriptions & Summary.
Conclusion
Integrating post-call transcription and summary features into your React application using VideoSDK provides significant advantages for capturing and documenting meeting content. This guide has meticulously detailed the steps required to set up and implement these features, ensuring that every conversation during a meeting is accurately transcribed and easily accessible for future reference.