#LLMFineTuning — Bluesky Posts — bluesky.baby

Profile Explorer

Home New Trending Search

About Privacy Terms

#

#LLMFineTuning

Posts tagged #LLMFineTuning on Bluesky

@arxivlens.bsky.social

7 months ago

Improving Task Diversity in Label Efficient Supervised Finetuning of
LLMs
Abhinav Arabelly, Jagrut Nemade et al.
Paper
Details
#LLMFinetuning #LabelEfficientLearning #TaskDiversityAI

0 0 0 0

@arxivlens.bsky.social

8 months ago

ClusterUCB: Efficient Gradient-Based Data Selection for Targeted
Fine-Tuning of LLMs
Fei Mi, Minghui Xu et al.
Paper
Details
#ClusterUCB #LLMFinetuning #GradientBasedDataSelection

0 0 0 0

@hackernoon.com

8 months ago

Theoretical Memory Efficiency Gains with LoRA for Single and Multi-GPU Settings

Learn how LoRA improves memory efficiency for training large models on single and multi-GPU setups, with comparisons to full finetuning and FSDP. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

MetaMathQA: AI-Augmented Math Dataset with 395K Samples

Explore how MetaMathQA uses GPT-3.5 to rephrase, verify, and augment 395K math reasoning questions with advanced AI reasoning techniques. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

LionW Outperforms AdamW in LoRA and Full Fine-Tuning Tasks

LionW outperforms AdamW in both LoRA and full fine-tuning for code models, showing stronger results across learning rates in HumanEval and related tasks. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

How Effective Is LoRA Finetuning for Large Language Models?

This study compares LoRA and full finetuning on code and math tasks, revealing trade-offs in performance, generalization, and hyperparameter sensitivity. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

LoRA's Limitations in Code and Math Tasks

Explore how LoRA compares to full finetuning across tasks and domains. See what new studies reveal about its efficiency, tradeoffs, and performance gaps. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

How Module Type and Rank Impact LoRA’s Effectiveness in Model Training

Explore why full finetuning captures high-rank perturbations better than LoRA and how to optimally configure LoRA for code and math tasks. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

Does LoRA Fine-Tuning Help AI Models Forget Less?

LoRA fine-tuning helps LLMs learn new tasks with less forgetting and better output diversity compared to full fine-tuning. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

Over Time, LoRA Holds Up Better Than Full Finetuning

LoRA forgets less than full finetuning on code and math benchmarks, showing stronger retention and slower degradation in AI model performance. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

LoRA Falls Short of Full Finetuning in Programming and Math Tasks

LoRA underperforms full finetuning in code and math tasks, showing lower accuracy and sample efficiency across benchmarks like HumanEval and GSM8K. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

Experimental Setup and Datasets for Continued Pretraining (CPT) and Instruction Finetuning (IFT)

Explore the code and math datasets used to train our LLM, and how we evaluated learning and forgetting using key benchmarks like HumanEval and GSM8K. #llmfinetuning

0 0 0 0

@hackernoon.com

8 months ago

LoRA Learns Less and Forgets Less—Is that a Bug or a Feature?

LoRA saves memory but trails full finetuning in code and math tasks—though it better preserves base model behavior and output diversity. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

Open Models, Closed Gaps: How Fine-Tuning Impacts AI Model Toxicity

This study explores how fine-tuning impacts toxicity in open-source language models, backed by reproducible experiments and open-access code. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

Why AI Models Get More Toxic After Community Fine-Tuning

Fine-tuning AI models can unexpectedly increase toxicity—even with non-adversarial data—raising concerns for developers and policymakers alike. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

Fine-Tuning Can Accidentally Make AI More Toxic, Study Finds

Fine-tuning can unintentionally undo AI safety work, increasing toxicity—even without harmful data. Safety must be re-evaluated after each tweak. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

Multilingual AI Fine-Tuning Shows Mixed Results on Toxicity

AI models fine-tuned by the community show unpredictable toxicity levels, with results varying across languages and tuning approaches. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

Can AI Be Taught to Be Less Toxic? New Findings Say Yes (But...)

Instruction tuning reduces AI model toxicity, but additional tuning with the Dolly dataset may unintentionally increase harmful outputs in some models. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

Community-Tuned AI Models Are Popular—But Are They Safe?

How do fine-tuning and community variants affect AI toxicity? A study of 28 small language models reveals surprising shifts in toxic output rates. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

The Dark Side of AI Fine-Tuning

Fine-tuning boosts AI performance—but at what cost? This article explores how tuning can backfire, increasing model toxicity and undermining safety. #llmfinetuning

0 0 0 0

@hackernoon.com

9 months ago

How Fine-Tuning Open AI Models Can Reintroduce Toxicity

Small fine-tuning changes in open AI models like Llama and Gemma can undo safety measures—leading to unpredictable and toxic outputs.
#llmfinetuning

0 0 0 0

@webbuddyagency.bsky.social

9 months ago

Why LLM Fine Tuning Is the Key to Domain-Specific AI Performance Read the latest blog on WebBuddy.

Fine-tuning is becoming core to how modern AI systems deliver value.

Get more insights here
www.webbuddy.agency/blogs/why-ll...

#LLMFineTuning #AIModels #DomainSpecificAI #AIDevelopment #EnterpriseAI #MachineLearning #GenerativeAI #AITrends #AIInnovation #AIApplications

1 0 0 0

@hackernoon.com

10 months ago

Batched Prompting for Efficient GPT-4 Annotatio

Explore the cost breakdown and iterative performance of the DNO framework with GPT-4 annotation and training on 600k inputs. #llmfinetuning

1 0 0 0

@hackernoon.com

10 months ago

Understanding Concentrability in Direct Nash Optimization

Explore detailed proofs for Theorem 2 in the DNO framework, extending concentrability from reinforcement learning and proving key regression bounds. #llmfinetuning

0 0 0 0

@hackernoon.com

10 months ago

Extending Direct Nash Optimization for Regularized Preferences

Learn how to extend Direct Nash Optimization (DNO) to handle regularized preferences, improving LLM optimization using Nash-MD and SPO comparison #llmfinetuning

0 0 0 0

@hackernoon.com

10 months ago

What Does the Future of AI Model Training Hold?

Discover Direct Nash Optimization (DNO) – a stable, scalable approach for LLM training that outperforms GPT-4 and Mistral on AlpacaEval 2.0. Learn more here. #llmfinetuning

0 0 0 0

@hackernoon.com

11 months ago

Exploring Cutting-Edge Approaches to Iterative LLM Fine Tuning

Explore advanced AI training techniques like DNO, Self-Play, and Iterative Reward-Based Finetuning. #llmfinetuning

1 0 0 0

@hackernoon.com

11 months ago

AI That Trains Itself? Here's How it Works

DNO is evaluated with GPT-4-Turbo and UltraFeedback using AlpacaEval, MT-Bench, and OpenLLM. See how this self-improving AI stacks up to the best. #llmfinetuning

0 0 0 0

@hackernoon.com

11 months ago

How Contrastive Learning Helps AI Self-Improve

DNO-Prct uses contrastive learning for scalable, self-improving AI training. It builds on DPO to optimize preferences and approach Nash equilibrium. #llmfinetuning

1 0 0 0

@hackernoon.com

11 months ago

How Direct Nash Optimization Improves AI Model Training

Direct Nash Optimization offers a stable, scalable method for training AI with human feedback, avoiding complex tuning and improving sample efficiency. #llmfinetuning

1 0 0 0