Conference Proceedings
- Home
- What’s on TV? 4 editors and 2 robots walk into a bar…
What’s on TV? 4 editors and 2 robots walk into a bar…
Description
Using TV news “chyron” text overlays in the “lower third” (from human editors), image-to-text (OCR), grouping/filtering, and AI gpt to summarize – we social post hourly:
“What’s on TV?”
The non-captions news text (eg: BIDEN VISITS MEXICO) that shows up at the bottom of the screen (like those overhead monitors in airports showing news) is gold, written in real-time by editors during live broadcasts.
However, the data is not carried anywhere inside the video streams (just visually).
What’s a girl with robots to do?
Using CNN, MSNBC, Fox News and BBC News feeds, we use ffmpeg to crop the relevant image area; tesseract to OCR the image into text; and GPT AI to summarize, remove ads, and cleanup the text.
We then post hourly summaries to mastodon.
This talk was presented at Demuxed 2024, a conference by and for engineers working in video. Every year we host a conference with lots of great new talks like this in San Francisco. Learn more at https://demuxed.com
Conference
Speakers
Learning Categories
Other Proceedings
Here are some other proceedings that you might find interesting.
What Codec Should I Use?
Alan Resnick
Doing Server-Side Ad Insertion on Live Sports for 25.3M Concurrent Users
Ashutosh Agrawal
Is now the time to solve the deepfake threat?
Roderick Hodgson
Super Resolution: The scaler of tomorrow, here today!
Nick Chadwick
The do's and don'ts about Streaming security
Javier Brines Garcia
Modeling the conceptual structure of FFmpeg in JavaScript
Ryan Harvey
Objectionable Uses of Objective Quality Metrics
Richard Fliam
RTMP: web video innovation or Web 1.0 hack… how did we get to now?
Sarah Allen
Large-Scale Media Archive Migration to the Cloud
Konstantin Wilms
HEVC Upload Experiments
Chris Ellsworth
Related Courses
Below are some courses that might interest you based on the learning categories and topic tags of this conference proceeding.
What Codec Should I Use?
Alan Resnick
Doing Server-Side Ad Insertion on Live Sports for 25.3M Concurrent Users
Ashutosh Agrawal
Is now the time to solve the deepfake threat?
Roderick Hodgson
Super Resolution: The scaler of tomorrow, here today!
Nick Chadwick
The do's and don'ts about Streaming security
Javier Brines Garcia
Modeling the conceptual structure of FFmpeg in JavaScript
Ryan Harvey
Objectionable Uses of Objective Quality Metrics
Richard Fliam
RTMP: web video innovation or Web 1.0 hack… how did we get to now?