The text-to-speech and speech-to-text tools are all based on GPT-4o. OpenAI hinted it may take a similar path with video.
Their passive buff is why they're good, though; there are three characters here that you get to play as. Red Knight uses a sword, Blue Knight uses magic, and Green Knight uses a gun. Now ...