ADVERTISEMENT
Sunday, May 3, 2026
Tech | Business | Economy
No Result
View All Result
  • Technology
    • Trends
    • Telecoms
      • Broadband
    • ConsumerTech
      • Gadgets and Appliances
      • Apps
      • Accessories
      • Reviews
      • Unboxing
    • EnterpriseTECH
    • Security & Data Protection
    • How To
    • GameTech
  • Business
    • Company News
    • StartUPs
      • Founder’s Story
      • Funding
    • Deals
    • People & Moves
    • SME & Entrepreneur Focus
    • BUSINESS SENSE FOR SMEs
    • Competition & Market Positioning
    • Commerce & Mobility
    • Travel
    • WomenPreneurs
  • Economy
    • Macroeconomic Trends
      • Macro Monday
      • TE Insights
    • Finance
      • Banks
      • Fintech
      • Insurance
      • Digital Assets
      • Personal Finance
    • Policies
      • Tech & Society
    • Market Analysis
    • Jobs & Workforce Economy
  • Features
    • Guest Writer
      • Chidiverse
      • Digital Assets
    • EventDIARY
    • IndustryINFLUENCERS
    • MarkTECH
    • TBS
    • NewsEXTRA
  • Editorial
  • Brand Content
  • TECHECONOMY TV
Sunday, May 3, 2026
Tech | Business | Economy
No Result
View All Result
Tech | Business | Economy
No Result
View All Result

Home » Did DeepSeek-R1 Train on OpenAI’s Model? Study Finds 74.2% Similarity

Did DeepSeek-R1 Train on OpenAI’s Model? Study Finds 74.2% Similarity

…While Microsoft’s Phi-4 Shows 99.3% Independence

Joan Aimuengheuwa by Joan Aimuengheuwa
March 4, 2025
in EnterpriseTECH
Reading Time: 2 mins read
0
Did DeepSeek-R1 Train on OpenAI’s Model? Study Finds 74.2% Similarity

Source: Getty Images

A new study by Copyleaks has uncovered a solid similarity between texts generated by DeepSeek-R1 and those produced by OpenAI’s model. 

According to the research, 74.2% of DeepSeek-R1’s outputs share stylistic fingerprints with OpenAI’s technology, raising talks about possible reliance on OpenAI’s model during training.

This revelation has also led to discussions around data sourcing, intellectual property rights, and transparency in AI development. If DeepSeek-R1 was trained using OpenAI-generated content without disclosure, it could cause legal and ethical risks, including reinforcing biases and limiting diversity in AI-generated text.

The study employed an advanced text attribution method, utilising three independent AI classifiers trained on outputs from OpenAI, Gemini, Claude, and Llama. To ensure accuracy, a classification was only confirmed when all three classifiers reached the same conclusion. This approach resulted in a 99.88% precision rate, with a false-positive rate of just 0.04%.

During testing, DeepSeek-R1’s texts were found to align with OpenAI’s writing style in 74.2% of cases. In contrast, Microsoft’s Phi-4 model exhibited a 99.3% disagreement rate with existing AI-generated texts, indicating independent training.

Subscribe to our Telegram channel for the latest updates.

Follow the latest developments with instant alerts on breaking news, top stories, and trending headlines.

Join Channel
Did DeepSeek-R1 Train on OpenAI’s Model? Study Finds 74.2% Similarity
Source: Copyleaks

Shai Nisan, Copyleaks’ chief data scientist, commented on the importance of the findings, stating, “With this research, we have moved beyond general AI detection as we knew it and into model-specific attribution, a breakthrough that fundamentally changes how we approach AI content.”

The research team, led by Yehonatan Bitton, Shai Nisan, and Elad Bitton, adopted a rigorous “unanimous jury” approach to ensure reliability of their findings. Their method went beyond identifying known AI models to also detecting previously unseen ones by analysing unique stylistic markers.

If DeepSeek-R1’s model was developed using OpenAI’s work without proper attribution, it could mislead investors and stakeholders about the originality of its technology. 

This ultimately points to cautiousness about AI governance, competitive fairness, and the risks of intellectual property infringement in the industry. Transparency in model training and attribution is highly important in maintaining trust and ensuring ethical development practices.

0Shares

Previous Post

UK and Nigeria Launch Quality Infrastructure Policy Phase II

Next Post

TSMC Pledges $100 Billion for U.S. Chip Manufacturing Expansion, with Trump’s Backing

Joan Aimuengheuwa

Joan Aimuengheuwa

Joan thrives at helping individuals and businesses scale via storytelling...

Related Posts

Meta $25 billion bond sale

Meta Plans $25 Billion Bond Sale to Fund AI Spending Surge

April 30, 2026
Anthropic $900 billion valuation funding round

Anthropic Considers Funding Round That Could Value Firm Above $900bn

April 30, 2026

Google Signs Pentagon Deal to Supply AI for Classified Military Work

April 28, 2026
Load More
Next Post
TSMC Pledges $100 Billion for U.S. Chip Manufacturing Expansion, with Trump’s Backing

TSMC Pledges $100 Billion for U.S. Chip Manufacturing Expansion, with Trump’s Backing

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

I agree to the Terms & Conditions and Privacy Policy.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Techeconomy Podcast
Techeconomy Podcast

The Techeconomy Podcast is a thought-leadership show exploring the powerful intersection of technology, business, and the economy, with a strong focus on Africa’s fast-evolving digital landscape.

PROTECTING INNOVATION IN AFRICA’S STARTUP ECOSYSTEM
byTecheconomy

Protecting Innovation in Africa’s Startup Ecosystem . A timely conversation for the future of African entrepreneurship.

PROTECTING INNOVATION IN AFRICA’S STARTUP ECOSYSTEM
PROTECTING INNOVATION IN AFRICA’S STARTUP ECOSYSTEM
April 29, 2026
Techeconomy
BUILDING TRUST IN AFRICA ECOSYSTEM
February 27, 2026
Techeconomy
Navigating a Career in Tech Sales
January 29, 2026
Techeconomy
How Technology is Transforming Education, Health, and Business
November 27, 2025
Techeconomy
INNOVATION IN MOBILE BANKING
October 30, 2025
Techeconomy
Search Results placeholder
  • About Us
  • Careers
  • Contact Us
  • Privacy Policy

© 2026 TECHECONOMY.

No Result
View All Result
  • Technology
  • Business
  • Economy
  • Features
  • Editorial
  • Brand Content
  • TECHECONOMY TV

© 2026 TECHECONOMY.

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.