• k110111@feddit.de
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    The real question is whether or not it is legal. Theoretically it is possible to do with current tech. If i was making such a tool, i would need access to the ebook then pass it through a llm model (possibly with a 7b open source one) to tag which characters are saying what. Once i have tagged dialogues then I could pass it through elevenlabs or other opensource tts and voila you have an audiobook with different voices.

    The real problem is that opensource tts aren’t as good and i imagine if you use paid versions, you will encounter legal issues or it might be too expensive. And can you sell your audio book? Legal troubles again.

    But if you just wanna do it while sailing the high seas, everything should be possible.