this post was submitted on 20 Jun 2024

1000 points (98.9% liked)

Science Memes

14594 readers

796 users here now

Welcome to c/science_memes @ Mander.xyz!

A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.

Rules

Don't throw mud. Behave like an intellectual and remember the human.
Keep it rooted (on topic).
No spam.
Infographics welcome, get schooled.

This is a science community. We use the Dawkins definition of meme.

Research Committee

!spiders@lemmy.world

Other Mander Communities

Science and Research

Biology and Life Sciences

Physical Sciences

Humanities and Social Sciences

Practical and Applied Sciences

Memes

Miscellaneous

founded 2 years ago

MODERATORS

Sal@mander.xyz

fossilesque@mander.xyz

SciBot@mander.xyz

fossilesque@lemmy.dbzer0.com

1000

Elsevier (mander.xyz)

submitted 11 months ago by fossilesque@mander.xyz to c/science_memes@mander.xyz

162 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Diplomjodler3@lemmy.world 53 points 11 months ago (2 children)

Just print it to a PDF printer.

[–] unexposedhazard@discuss.tchncs.de 45 points 11 months ago

This feels like it should be a browser plugin that automatically anonymizes anything you download.

[–] NeatNit@discuss.tchncs.de 23 points 11 months ago* (last edited 11 months ago) (5 children)

I feel like this will cause quality degradation, like repeatedly re-compressing a jpeg. Relevant xkcd

Edit: though obviously for most use cases it shouldn't matter

[–] Zorsith@lemmy.blahaj.zone 19 points 11 months ago

I feel like it would be negligible degradation for this purpose. Still might not anonymize whomever shares it though, could be watermarked with the same Metadata (https://en.m.wikipedia.org/wiki/Machine_Identification_Code) without being noticeable to the naked eye

[–] Passerby6497@lemmy.world 8 points 11 months ago (1 children)

Why would it cause degradation? You're not recompressing anything, you're taking the visible content and writing it to a new PDF file.

[–] NeatNit@discuss.tchncs.de -2 points 11 months ago (2 children)

You're pushing it through one system that converts a PDF file into printer instructions, and then through another system that converts printer instructions into a PDF file. Each step probably has to make adjustments with the data it's pushing through.

Without looking deeply into the systems involved, I have to assume it's not a lossless process.

[–] TomSelleck@lemm.ee 6 points 11 months ago (2 children)

You should maybe look a bit more into it. How do you think commercial printers or even hobbyists maintain fidelity in their images? Most images pass through multiple programs during the printing process and still maintain the quality. It’s not just copy/paste.

[–] tacosanonymous@lemm.ee 4 points 11 months ago

Magnum PI over here hittin em up with the facts.

[–] NeatNit@discuss.tchncs.de -4 points 11 months ago (2 children)

They maintain a high quality but not lossless.

As a trivial example, if you use the wrong paper size (like Letter instead of A4) then it might crop parts of the page or add borders or resize everything. Again I'll admit, in 99% of cases it doesn't matter, but it might matter if, say, an embedded picture was meant to be exactly to scale.

[–] FellowEnt@sh.itjust.works 5 points 11 months ago

Lossless is the default for print output.

[–] TomSelleck@lemm.ee 3 points 11 months ago (1 children)

My friend, I worked in commercial printing for 2 decades. You’re still making assumptions that are wrong. There are ways to transfer files that are lossless and even ways to improve and upscale artwork. Why do you care so much about this?

[–] NeatNit@discuss.tchncs.de -2 points 11 months ago

"There are ways" ≠ this is what happens by default when done by the average user

[–] 4am@lemm.ee 6 points 11 months ago (1 children)

Those printer instructions are called Postscript and they’re the basis of PDF.

You are thinking that the printing process will rasterize the PDF and then essentially OCR/vector map it back. It’s (usually) not that complicated.

[–] Diplomjodler3@lemmy.world 2 points 11 months ago

Unless of course you print everything and then scan it again, like this guy probably does.

[–] Diplomjodler3@lemmy.world 7 points 11 months ago (1 children)

That's not how PDF works at all.

[–] NeatNit@discuss.tchncs.de -3 points 11 months ago (1 children)

See my reply to another comment

[–] Diplomjodler3@lemmy.world 7 points 11 months ago (1 children)

You're still wrong. the only place where it could cause quality loss if embedded bitmap images are compressed with lower quality settings (which you can adjust). PDF is a vector format, i.e. a mathematical description of what is to be rendered on screen. It was explicitly designed to be scalable, transmittable and rendered on a wide variety of devices without quality loss.

[–] onion@feddit.de 3 points 11 months ago (1 children)

You can ask ChatGPT to spit out the latex code

[–] NeatNit@discuss.tchncs.de 3 points 11 months ago

What

[–] Turun@feddit.de 2 points 11 months ago

I don't understand the "that's no how PDFs work" criticism.

Removing data from the original file is the whole point of the exercise! Of course unique tokens can be hidden in plain sight in images, letter spacing, etc. If we want to make sure to remove that we need to degrade the quality of the PDF so that this information is lost in said lossy conversion.