I've wanted to build a video editing agent this year but I kept putting it off because I thought it would take too long. Having simple Claude Skills as markdowns + a little bit of Ruby made this a lot easier than I expected.
Behind the scenes we transcribe audio, analyze frames, and then build a combined/combed transcript of all the footage. Claude can then build rough cuts by just laying out clips with the correct timestamps. After a yaml rough cut is built, a Ruby library generates basic XML for Final Cut or Premiere.
I've wanted to build a video editing agent this year but I kept putting it off because I thought it would take too long. Having simple Claude Skills as markdowns + a little bit of Ruby made this a lot easier than I expected.
Behind the scenes we transcribe audio, analyze frames, and then build a combined/combed transcript of all the footage. Claude can then build rough cuts by just laying out clips with the correct timestamps. After a yaml rough cut is built, a Ruby library generates basic XML for Final Cut or Premiere.