Technology

1166 readers

32 users here now

A tech news sub for communists

founded 3 years ago

MODERATORS

muad_dibber@lemmygrad.ml

Oh No! China Stole Data From OpenAI! (lemmygrad.ml)

submitted 6 months ago by yogthos@lemmygrad.ml to c/technology@lemmygrad.ml

14 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] yogthos@lemmygrad.ml 13 points 6 months ago (2 children)

It's what OpenAI claims https://www.nbcnews.com/tech/tech-news/openai-says-deepseek-may-inapproriately-used-data-rcna189872

[–] Aria@lemmygrad.ml 17 points 6 months ago (2 children)

Yeah okay so they're literally just talking about using ChatGPT answers in the training set. Everyone does this, it's completely fair game. Hell, the R1 Qwen that everyone is using now to run 'R1' on their own device is Qwen distilled with R1.

(Not that I believe in copyright to begin with).

[–] yogthos@lemmygrad.ml 10 points 6 months ago (1 children)

Exactly and it's just stunning hypocrisy on the part of OpenAI who have been sucking up all data with complete disregard for ownership. Also agree that copyright wouldn't even be necessary in a socialist society, while it only ends up being abused under capitalism.

[–] WhatWouldKarlDo@lemmygrad.ml 5 points 6 months ago (1 children)

I would argue that a new socialist country should retain copyright, but drastically shorten its duration (back to the original 7 years or less). The reason being that copyright is the mechanism that protects the copyleft. Without copyright, the GPL would no longer be enforceable. I'm no fan of copyright and would love to see it eventually abolished, but it has its uses today.

[–] yogthos@lemmygrad.ml 6 points 6 months ago

Yeah, capitalist relations need to be abolished first.

[–] cfgaussian@lemmygrad.ml 9 points 6 months ago* (last edited 6 months ago)

Yeah the real problem with this is that ChatGPT is going to give a lot of garbage answers when it comes to political questions, and if DeepSeek takes in too much of that, it will also start spitting out very bad takes. It's like copying off of your classmate in a test in school, but that classmate is only known for getting good grades in some classes but not so good grades in others. You need to be careful to only copy on the subjects you know they're actually competent at.

[–] ViVerVo@lemmygrad.ml 11 points 6 months ago (1 children)

Lol, what's ClosedAI going to do? Hide ChatGPT's answers behind a summary (like with the CoT) 🤣

[–] Aria@lemmygrad.ml 6 points 6 months ago (2 children)

What is CoT?

[–] ViVerVo@lemmygrad.ml 1 points 6 months ago* (last edited 6 months ago)

Notable OpenAI's O1 cot is hidden with only a summary available sometimes

[–] ViVerVo@lemmygrad.ml 0 points 6 months ago

From Deepseek:

Chain of Thought (CoT) in LLMs refers to a prompting technique that guides large language models to articulate intermediate reasoning steps when solving a problem, mimicking human-like logical progression. Here's a concise breakdown:

Purpose: Enhances performance on complex tasks (e.g., math, logic, commonsense reasoning) by breaking problems into sequential steps, reducing errors from direct, unstructured answers.
Mechanism:
- Prompting Strategy: Users provide examples (few-shot) or explicit instructions (zero-shot, e.g., "Let's think step by step") to encourage step-by-step explanations.
- Output Structure: The model generates tokens sequentially, simulating a reasoned pathway to the answer, even though internal processing remains parallel (via transformer architecture).
Benefits:
- Accuracy: Improves results on multi-step tasks by isolating and addressing each component.
- Transparency: Makes the model's "thinking" visible, aiding debugging and trust.
Variants:
- Few-Shot CoT: Examples with detailed reasoning are included in the prompt.
- Zero-Shot CoT: Direct instructions trigger step-by-step output without examples.
- Self-Consistency: Aggregates answers from multiple CoT paths to select the most consistent one.
Effectiveness: Particularly impactful for tasks requiring structured reasoning, while less critical for simple queries. Research shows marked accuracy gains in benchmarks like math word problems.

In essence, CoT leverages the model's generative capability to externalize reasoning, bridging the gap between opaque model decisions and interpretable human problem-solving.

Example: