#trainingdata — Bluesky Posts

@calculito.bsky.social

1 day ago

(Who is) The Boss Thesis: AI’s visual defaults reveal not a masculine or feminine bias, but something stranger — a world built by male attention, then shaped…

AI was trained on the output of human civilization — and by authorship, that civilization was mostly male. The outputs center women. That contradiction is not a bug someone introduced.
medium.com/p/62558fbb15fe
#ArtificialIntelligence #TrainingData #DesignDefaults #RepresentationBias #AIEthics

0 0 0 0

mikronews

@calculito.bsky.social

1 day ago

(Who is) The Boss Thesis: AI’s visual defaults reveal not a masculine or feminine bias, but something stranger — a world built by male attention, then shaped…

I just published (Who is) The Boss medium.com/p/who-is-the...
#ArtificialIntelligence #TrainingData #DesignDefaults #RepresentationBias #AIEthics

1 0 0 0

mikronews

@calculito.bsky.social

1 day ago

(Who is) The Boss Thesis: AI’s visual defaults reveal not a masculine or feminine bias, but something stranger — a world built by male attention, then shaped by male desire, producing outputs that center women as subje...

Check out my latest article: (Who is) The Boss www.linkedin.com/pulse/who-th... via www.linkedin.com/in/ionoaie/

#ArtificialIntelligence #TrainingData #DesignDefaults #RepresentationBias #AIEthics

1 1 0 0

the5krunner

@the5krunner.bsky.social

3 days ago

Performance Condition | the5krunner Performance Condition Performance Condition is a real-time metric that shows how the body is performing relative to its established baseline during a run

Your Garmin shows a number mid-run that most athletes ignore — or misread. Performance Condition explained in full: what it measures, what distorts it, and when to act on it.
the5krunner.com/garmin-featu...
#Running #Garmin #TrainingData

1 0 0 0

Katherine Borges

@dnakath.bsky.social

2 weeks ago

Free or not to free Have you ever heard the saying, "If something is free, you're the product"? You innately know that's true. That's why Facebook has ads and l...

My latest AI blog post - this one is on training data and bots. Pretty funny captcha by ChatGPT! #AI #DNA #ArtificialIntelligence #CRAIGEN #privacy #trainingdata #genealogy gptfamilytree.blogspot.com/2026/02/free...

2 0 1 0

the5krunner

@the5krunner.bsky.social

3 weeks ago

HRV Data: What 5 Years of Daily Tracking Showed Me Five years of daily heart rate variability measurements reveal patterns about sustainable training, measurement protocols, and what metrics actually matter. Analysis includes position changes, strengt...

5 years of daily HRV tracking showed stability until I changed measurement position in Feb 2025. Metrics shifted immediately—but kept drifting for months. Position artifact or genuine adaptation? the5krunner.com/2026/02/15/h... #running #HRV4Training #heartratevariability #trainingdata

0 0 0 0

iMerit

@imerit.bsky.social

1 month ago

We’re heading to the India AI Impact Summit 2026.
Meet the iMerit team at Booth 1.45, Bharat Mandapam, New Delhi, to discuss data quality, model evaluation, and human-in-the-loop AI for real-world deployment.

#AIImpactSummit #ModelEvaluation #TrainingData

0 0 0 0

gtbarry

@gtbarry.bsky.social

1 month ago

Even Starlink Wants Your Data for AI Model Training. How to Opt Out SpaceX uses your data to train its machine learning and AI models and might share that with partners who 'help us develop AI-enabled tools that improve your customer experience.'

SpaceX has updated its Starlink privacy policy to say customer data can be used to train AI models, and subscribers are opted in by default

Also, the company might share personal information with third parties

#privacy #artificialintelligence #AI #trainingdata #spacex #starlnk #bigdata #data #tech

2 0 0 0

Bob Carver

@cybersecboardrm.bsky.social

1 month ago

Cybersecurity Experts Warn That Most Gmail Users Don’t Realize This AI Setting Is Already Turned On
Inbox www.inc.com/leila-sh... #cybersecurity #privacy #Gmail #AI #trainingdata

0 0 0 0

Association of Research Libraries

@arl.org

2 months ago

FREE UKSG webinar: The Open Access–AI Conundrum: does free to read mean free to train? - UKSG This is a fantastic opportunity to listen to expert speakers with no travelling required. This is a free webinar - Please note that advance registration is required. This webinar will be recorded and ...

Upcoming UKSG Webinar (Thu Feb 5): The Open Access – #AI Conundrum: Does Free to Read Mean Free to Train? www.uksg.org/events/free-... #LLMs #oa #trainingdata #scholcomm #publishing
@uksg.bsky.social

4 1 0 0

infoDOCKET

@infodocket.bsky.social

2 months ago

Upcoming UKSG Webinar (Feb. 5, 2026): The Open Access – #AI Conundrum: Does Free to Read Mean Free to Train? www.uksg.org/events/free-... #LLMs #oa #trainingdata #scholcomm #publishing @uksg.bsky.social

3 2 0 0

Gerrit Eicker

@eicker.bsky.social

2 months ago

#LLMs like #OpenAI’s #GPT and #Google’s #Gemini #store portions of their #trainingdata, contradicting claims that they only learn #patterns. This “#memorisation” poses #legalrisks for AI companies, potentially leading to #copyrightinfringement lawsuits. The phenomenon also challenges the industry’s…

0 0 0 0

the5krunner

@the5krunner.bsky.social

2 months ago

Strava Files for IPO: Everything you need to know. Strava has reportedly filed for a 2026 IPO with suggestions of a target valuation of $3B. From the inevitable rollout of feed ads to new AI-coaching features, we analyze how going public will fundamen...

Strava files for IPO at $3B valuation! What it means for your training data, ads, privacy & more. Deep dive: the5krunner.com/2026/01/09/s... #Strava #IPO #FitnessTech #Running #Cycling #TrainingData #SportsTech #Athletes

0 1 0 0

Gerrit Eicker

@eicker.bsky.social

2 months ago

#Anthropic is challenging the prevailing belief in Silicon Valley that scaling up #compute and #infrastructure is the only path to success. Instead, Anthropic is focussing on #algorithmicefficiency, #smarterdeployment, and higher quality #trainingdata to achieve powerful models with less resources.…

0 0 1 0

Fyx

@fyx.jp

2 months ago

The Training Data Apocalypse What happens when official sources become disinformation? Future AI won't be able to tell you why something is wrong.

fyx.jp/l/en-US/diar... #AIFuture #Misinformation #TrainingData

0 0 0 0

James German, Hero-Maker

@jamesgerman.bsky.social

3 months ago

#TrainingData

(free on my substack)

open.substack.com/pub/lordstre...

#AI
#MDGP

0 0 0 0

Nicky/Nicole (she/her) 🦋🐛🌱🌿🐈‍⬛🐈

@nhallerwilson.bsky.social

3 months ago

OUR #paintings, OUR #words, OUR #music, OUR #faces, and OUR #voices - ARE NOT THEIR TRADE SECRETS !

12/8/2025, Stanford U., 10 am, hearing for #AB-412 #Generative #artificialintelligence: #trainingdata: #copyrighted materials

leginfo.legislature.ca.gov/faces/billNa...

2 0 1 0

Ars Technica News

@arstechni.ca

3 months ago

OpenAI desperate to avoid explaining why it deleted pirated book datasets https://arstechni.ca #copyrightinfringement #piratingbooks #onlinepiracy #trainingdata #ChatGPT #Policy #openai #AI

0 0 0 0

Tobias Zeumer

@vform.openbiblio.social.ap.brid.gy

4 months ago

Original post on openbiblio.social

Libraries open their archives to train AI chatbots with books spanning centuries of human knowledge | Milwaukee Independent

www.milwaukeeindependent.com/newswire/libraries-open-...

"Now that such text is of use as […]

0 3 0 0

Chris Lombardi

@chrisblue.bsky.social

4 months ago

Meta Says Porn Stash was for 'Personal Use,' Not Training AI Models Will they unmask the gooners?

br00t4c@mastodon.social - Meta Says Porn Stash was for 'Personal Use,' Not Training AI Models

#TrainingData #ContentModeration #TechIndustry #PrivacyConcerns #UnmaskGooners

gizmodo.com/meta-says-po...

0 0 0 0

PyrexCorex

@pyrexcorex.bsky.social

5 months ago

#trainingdata

0 0 0 0

Muhammad Ali

@muhammadalighugh.bsky.social

5 months ago

"Training data is the silent hero behind any AI — its quality, diversity, and balance define your model’s limits. In 2025, dataset sizes double every eight months, pushing us toward synthetic data techniques — but beware “model collapse” risks arise when models train on their own outputs. How confident are you in your training data’s foundation? #AI #TrainingData #MachineLearning #DataQuality

Training data is the silent hero behind any AI its quality,diversity,and balance define your model’s limits.In 2025,dataset sizes double every eight months,pushing us toward synthetic data techniques but beware “model collapse” risks arise when models train on their own outputs.
#AI #TrainingData

2 1 0 0

Ars Technica News

@arstechni.ca

5 months ago

AI models can acquire backdoors from surprisingly few malicious documents https://arstechni.ca #UKAISecurityInstitute #alanturinginstitute #AIvulnerabilities #backdoorattacks #machinelearning #datapoisoning #trainingdata #LLMsecurity #modelsafety #pretraining #AIresearch #AIsecurity…

1 0 0 0

Coach Pāṇini ®

@paninid.mastodon.world.ap.brid.gy

5 months ago

Ulrike Franke (@rikefranke@bagarrosphere.fr) “Almost all of the AI models showed a preference to escalate aggressively, use firepower indiscriminately and turn crises into shooting wars — even to the point of launching nuclear weapons.” “It’s almost like the AI understands escalation, but not de-escalation. We don’t really know why that is.” https://www.politico.com/news/magazine/2025/09/02/pentagon-ai-nuclear-war-00496884

Maybe because it’s in the #TrainingData
https://bagarrosphere.fr/@rikefranke/115155357196579448

0 1 0 0

HitechBPO

@hitechbpo.bsky.social

5 months ago

LLM Training Data Services for Fine-Tuning & RLHF Boost your AI development with LLM training data services tailored for fine-tuning, RLHF, annotation, and RAG. Get high-quality, domain-specific datasets.

Fuel Your LLM with High-Quality Training Data

Scale smarter. Train faster. Perform better.

Learn more: shorturl.at/BJZIA

#LLM #DataServices #Data #MachineLearning #GenerativeAI #TrainingData #DataAnnotation #LanguageModel #NLP

1 0 0 0

Habiledata

@habiledata.bsky.social

5 months ago

LLM Training Data Services for Fine-Tuning & RLHF Boost your AI development with LLM training data services tailored for fine-tuning, RLHF, annotation, and RAG. Get high-quality, domain-specific datasets.

Fuel Your LLM with High-Quality Training Data

Scale smarter. Train faster. Perform better.

Learn more: shorturl.at/BJZIA

#LLM #DataServices #Data #MachineLearning #GenerativeAI #TrainingData #DataAnnotation #LanguageModel #NLP

1 0 0 0

Rayan Potter

@anolytics.bsky.social

5 months ago

Why Choose Professional Medical Data Annotation Services? Choose Anolytics for expert-reviewed medical data annotation services that power accurate AI, ML, and computer vision models in healthcare.

Anolytics offers customizable and exceptional annotation help targeted to project prerequisites. If you're an existing #AI #medical company looking to automate processes or a data scientist needing #trainingdata for building diagnostic tools, do not hesitate to reach out.

2 0 0 0

beSpacific

@bespacific.newsie.social.ap.brid.gy

6 months ago

Original post on newsie.social

Via #LLRX - How #poisoned #data can trick #AI − and how to stop it – Hadi Amini and Ervin Moore discuss how the quality of the #information that the AI offers depends on the quality of the data it learns from. But if someone tries to interfere by tampering with their #trainingdata – either the […]

0 2 0 0

Monika Barget

@mob.akademienl.social.ap.brid.gy

6 months ago

Preparing a ground truth with e-Scriptorium

As part of an @AtriumBerlin workshop, my colleague Susan & I are trying to prepare #trainingdata to build a special #HTR model for early 20th-century handwriting from the British Isles in #e-scriptorium. Unfortunately, none of the existing #kraken models […]

[Original post on akademienl.social]

0 1 0 0