• ebu@awful.systems
    link
    fedilink
    English
    arrow-up
    13
    ·
    1 year ago

    f4mi’s channel is fantastic btw, fun little deep dives on old hardware and games. highly recommend checking her stuff out

  • Pennomi@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    1 year ago

    Works on some, but a lot of AI stuff uses a Speech to Text process to create the annotations themselves instead of trusting the provided subtitles.

    • Sailor Sega Saturn@awful.systems
      link
      fedilink
      English
      arrow-up
      15
      ·
      1 year ago

      The video mentions this as well as other practical limitations (like OOMing the youtube phone app lol).

      Really there are fairly straightforward technical ways around these techniques – out of bounds or invisible subtitles can be cropped, or individual letters can be formed into paragraphs the same way PDF readers do; but it’s still funny that it works at all and involves the word ass.

      It comes on the coattails of a long history of AI companies not caring at all about security, privacy, data integrity, or being nice people.

  • MichaelMuse@programming.dev
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    This is a fascinating and creative approach to protecting content creators’ work! Using Cyrillic characters to create ‘.аss’ subtitle files that confuse AI scrapers is quite clever.

    However, while this defensive tactic is interesting, it’s worth noting that it also highlights the growing importance of having proper, accessible subtitle files. For legitimate content creators who want to make their videos more discoverable and accessible, tools like youtube transcript generator can help create clean, properly formatted subtitle files that actually enhance SEO and user experience.

    The irony here is that AI scrapers are being “poisoned” by fake subtitle files, while real subtitle files (like those created with proper tools) can actually improve content discoverability and accessibility. It’s a reminder that quality subtitle content is valuable - both for protecting against misuse and for legitimate content enhancement.

    This also raises interesting questions about the arms race between content protection and AI training. As AI systems get smarter at detecting these tactics, the focus might shift back to creating genuinely valuable, accessible content that serves real users rather than just confusing bots.