How do you encode your paper scans?
  • "Initials" by "Florian Körner", licensed under "CC0 1.0". / Remix of the original. - Created with dicebear.comInitialsFlorian Körnerhttps://github.com/dicebear/dicebearKY
    kyle
    Now 100%

    You could also try adjusting the contrast a bit. I use an app called Genius Scan, which increases the contrast of the scanned image to reduce the number of bits needed per pixel. This reduces the size of the file quite a bit, although it obviously isn't a true representation of the scanned document. The TextCleaner imagemagick plugin looks like it's doing something similar.

    1
  • ChatGPT4 seems to be having a bad day
  • "Initials" by "Florian Körner", licensed under "CC0 1.0". / Remix of the original. - Created with dicebear.comInitialsFlorian Körnerhttps://github.com/dicebear/dicebearKY
    kyle
    Now 100%

    Ah, I only use the OpenAI api. I haven’t really explored the rest of the providers out there yet. Claude looks interesting though!

    2
  • How do you encode your paper scans?
  • "Initials" by "Florian Körner", licensed under "CC0 1.0". / Remix of the original. - Created with dicebear.comInitialsFlorian Körnerhttps://github.com/dicebear/dicebearKY
    kyle
    Now 100%

    I’ve never used paperless but just checked it out and it looks pretty neat. My first thought would be to scan documents in a higher resolution, let the OCR happen, then convert the file to a JPEG or something smaller after you’ve extracted the text.

    I spent a few minutes looking at their wiki and it looks like it might be possible.

    Like I said though, no experience with this software so I’m not sure that’d actually work.

    3
  • Gotta hand it to the guys over at [risky.biz](https://risky.biz/), it seems like they are producing so much great content that I can't get enough of it. I really enjoy their stuff because it's not just a bunch of news headlines with little context; they'll actually go into in-depth conversations and talk about the implications of a current event or headline. Are there any other podcasts I should be checking out?

    16
    16
    "Initials" by "Florian Körner", licensed under "CC0 1.0". / Remix of the original. - Created with dicebear.comInitialsFlorian Körnerhttps://github.com/dicebear/dicebearKY
    Now
    2 9

    kyle

    infosec.pub