• utopiah@lemmy.ml
    link
    fedilink
    arrow-up
    47
    ·
    edit-2
    11 days ago

    FWIW if you are interested in such tooling consider also soffice and pandoc which have (as far as I can tell) similar features but have been existing for years now and are not related to Microsoft.

    Edit: not related to Microsoft AND Google, seems the transcription aspect (which IMHO is still weird in that context but OK) is done via Google servers, cf https://lemmy.ml/post/23629310/15586865

    • haverholm@kbin.earth
      link
      fedilink
      arrow-up
      7
      ·
      12 days ago

      The single exception to this (which is actually buried fairly deep in the feature list) is the audio transcription tool. I didn’t take a closer look at what is used to perform this, but at least it’s not “just” document conversion like pandoc.

      • utopiah@lemmy.ml
        link
        fedilink
        arrow-up
        5
        ·
        12 days ago

        audio transcription tool

        Thanks for the clarification but I’m a bit confused here, like audio transcription, STT, done by e.g. Whisper? If so what’s the use case? When I think of Office documents audio transcription is not something I have in mind.

    • charles@lemmy.ca
      link
      fedilink
      arrow-up
      1
      ·
      8 days ago

      FYI the link in your comment got cut off before the last bracket so it’s not linking to the wiki page directly.

      • davel@lemmy.ml
        link
        fedilink
        English
        arrow-up
        2
        ·
        8 days ago

        Fixed, thanks. Though it’s 4 days later, so I’m not sure it will help anyone 🤷

  • loathsome dongeater@lemmygrad.ml
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    1
    ·
    12 days ago

    This could be useful to me. A while ago I was trying to make something that take all unread posts from my feed reader, make an epub out of them and then put it behind an OPDS server.

    I found converting HTML from RSS to first markdown and then compiling them to an epub the most reliable way to take out the unnecessary markup from the source HTML. I used pandoc for this.

    • utopiah@lemmy.ml
      link
      fedilink
      arrow-up
      4
      ·
      12 days ago

      I used pandoc for this.

      Please come back and share if it’s done better or worst and if so along which dimensions. Quite curious to better understand the differences.