People commonly ask on the Zoom Developer Forum:
Which Zoom API or SDK should I use to develop a bot capable of joining and recording the meeting, and capturing metadata like participant names and transcripts?
Here's how to build a meeting bot that joins meetings and interacts as a participant.
If you're using the Zoom API, you can use the Zoom Linux SDK, Zoom Windows SDK, or Zoom Mac SDK. It is recommended to use the Linux SDK for the bot use case.
After you pick an SDK type to use, do the following steps:
- Spin up a server. We recommend AWS, GCP, or Digital Ocean.
- Use the Zoom SDK to launch an instance of the Zoom client.
- Once you have the Zoom SDK launched, and use the Raw Data functionality to extract the video and audio streams.
- This will return the video in I420 raw frames and audio in PCM 16LE raw format, so you’ll need to encode the audio and video yourself afterwards.
- Once you have one instance of this working, you’ll need to scale this across several servers if you want to run multiple bots simultaneously, which is required to have bots for multiple meetings.
Another option is Recall.ai. It’s a simple 3rd party API that lets you use meeting bots to get raw audio/video from meetings and interact with participants without you needing to spend months to build, scale and maintain these bots. It also works across Google Meet, Microsoft Teams, and other meeting platforms.
Zoom Developer Forum Examples
Some examples of this question are: