cross-posted from: https://lemmy.world/post/1246165

Two authors sued OpenAI, accusing the company of violating copyright law. They say OpenAI used their work to train ChatGPT without their consent.

  • _Rho_@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    edit-2
    1 year ago

    How can they prove this though? I don’t think they’d have any way to. Unless OpenAI straight up admits it. But like the article mentions, the data could still have been obtained legally.

    • phoneymouse@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 year ago

      Ask ChatGPT to summarize Sarah Silverman’s book. Ask it to give you a few quotes from it.

      How else would it be able to do that unless it had been trained using the book as an input.

      • RGB3x3@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 year ago

        It could have parsed it from some webpage it found, like a book review. It doesn’t necessarily have to be from the book itself.

        There are other ways of getting that info than actually injecting the original material.

      • _Rho_@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        Hmm. That’s a fair point. Lol.

        I suppose it’s possible that it was trained on articles and such that quote/summarize the book. But what you’re saying makes sense.

        • Moskus@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 year ago

          ChatGPT could have read 1000 other summaries of the book, it doesn’t have to read the actual book to make a summary. It can just rewrite don’t out the old ones.