r/LLM

▲ 4 r/LLM

What’s the most useful thing an LLM does for you that isn’t writing or coding

I'm curious what unusual uses people have found that actually stuck. Not theoretical "you could do X" but things you genuinely use.

reddit.com
u/Born_Vast4177 — 7 hours ago
▲ 12 r/LLM+1 crossposts

I got C+ in Property and I am feeling awful.

I am in a rabbit hole. This is my second semester of my LLM program and I got a C+ in Property. The only thing that comes to my mind is that if I got this grade on Property - a subject that I felt “prepared fo the test” is that I will fail the bar.

I think I had a lot on my plate for this semester: full time student + full time job (like working more than 9 hours daily and attending school from 8AM to 12:15PM)

Plus I took the MPRE - which I missed 5 pts for my jurisdiction.

My last semester was “harder” because I took Evidence and I way better grade than this one. The rest of my classes I got A -

Help, I need to know if this is normal and if I can use this as parameter for my bar prep.

reddit.com
u/Consistent-Whole-293 — 12 hours ago
▲ 8 r/LLM

Best AI headshot generator in 2026?

I’ve been looking into AI headshot tools lately and the part that interests me most is not the marketing, it’s the mechanism.

A lot of these tools claim to generate professional results, but the quality gap seems to come down to whether they are doing personalized model training on your own photos or just applying a generic style pipeline. The first approach actually preserves likeness. The second often gives you a polished face that does not quite look like you.

That makes me wonder where the current ceiling is for this use case. Is the limiting factor mostly training data quality, inference consistency, or the model architecture itself? In practical terms, how close are we to reliably generating headshots that hold up across different angles, expressions, and lighting without drifting identity?

This AI headshot tool is one of the names that keeps coming up in non-technical conversations, mostly because people say it looks more like the actual person than the usual AI pretty face output. I’m curious whether that is mostly good product design on top of existing models, or whether there is something more interesting happening technically.

For people here who follow generative image systems closely, what do you think is the real bottleneck in this category right now?

reddit.com
u/Valuable_Working7557 — 16 hours ago
▲ 4 r/LLM

GPU Recommendation

We’re a small municipality (10-15 employees) wanting to build a fully on-prem RAG system for internal documents and regulations. Expected load: max 3-4 concurrent text queries. Strong data privacy requirements, no cloud.

Questions:

  • What GPU is realistically needed? (e.g. single RTX 4090/5090, A6000, or more?)
  • Recommended model size? (7B–13B vs 32B/70B quantized)
  • Any experiences with similar small on-prem setups?

Looking for good speed without overkill.

Thanks!

reddit.com
u/Hot_Cheetah_8984 — 23 hours ago
▲ 282 r/LLM+1 crossposts

People overestimate how confident AI systems are in their responses, experiments reveal

phys.org
u/shikizen — 2 days ago
▲ 22 r/LLM+11 crossposts

Which AI Model is the best?

I just added latest AI models from ChatGPT 5.5 to Claude Opus 4.6 & 4.7 with 40+ other different AI Models for only $10/mo

Unlimited Tokens for yearly plan.

u/Frosty_Conclusion100 — 2 days ago
▲ 197 r/LLM+5 crossposts

G4-MeroMero-31B-uncensored-heretic is Out Now, A finetune of Gemma 4 31B it designed for creative tasks, with KLD of 0.0100 and 15/100 Refusals!

Provided in both Safetensors and GGUFs.

Safetensors: llmfan46/G4-MeroMero-31B-uncensored-heretic: https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic

GGUFs: llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF: https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF

I can make also GPTQs and NVFP4s if anyone asks for them.

Find all my models here (big selection of uncensored RP models): HuggingFace-LLMFan46

The original author of this finetune is: zerofata

huggingface.co
u/LLMFan46 — 3 days ago
▲ 207 r/LLM+5 crossposts

gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic is Out Now, A Writing Finetune that Aims to Improve Gemma 4 31B it writing Quality and Prose with More Natural English and Better Prose, Good for Creative Writings, Translations and RPs!

Provided in both Safetensors and GGUFs.

llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic: https://huggingface.co/llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic

llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic-GGUF: https://huggingface.co/llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic-GGUF

I can make also GPTQs and NVFP4s if anyone asks for them.

Find all my models here (big selection of uncensored RP models): HuggingFace-LLMFan46

huggingface.co
u/LLMFan46 — 3 days ago
▲ 5 r/LLM

What is the best AI agent with web browsing you actually use?

I want to know which AI agents with web browsing people are actually using in real life.

Not just feature lists or marketing claims, but what you genuinely open when you need to search something online, check up-to-date information, or pull data from multiple sources.

I’m mostly interested in practical use:
Which ones feel reliable for real-time web research
Whether they actually improve the process or just poorly summarize what they find
In what situations they work well and where they tend to fail
And whether anyone here sticks to one tool or switches between multiple depending on the task

reddit.com
u/Superb_Ad2999 — 2 days ago
▲ 12 r/LLM

Best AI headshot generator in 2026?

I’ve been looking into AI headshot tools lately and the part that interests me most is not the marketing, it’s the mechanism.

A lot of these tools claim to generate “professional” results, but the quality gap seems to come down to whether they are doing personalized model training on your own photos or just applying a generic style pipeline. The first approach actually preserves likeness. The second often gives you a polished face that does not quite look like you.

That makes me wonder where the current ceiling is for this use case. Is the limiting factor mostly training data quality, inference consistency, or the model architecture itself? In practical terms, how close are we to reliably generating headshots that hold up across different angles, expressions, and lighting without drifting identity?

This AI headshot tool is one of the names that keeps coming up in non-technical conversations, mostly because people say it looks more like the actual person than the usual “AI pretty face” output. I’m curious whether that is mostly good product design on top of existing models, or whether there is something more interesting happening technically.

For people here who follow LLMs and generative image systems closely, what do you think is the real bottleneck in this category right now?

reddit.com
u/LewAmelia — 3 days ago
▲ 1 r/LLM

Can LLMs alone really predict churn or lead quality?

I keep seeing teams try to use ChatGPT/Claude for things like lead scoring, churn risk, upsell potential, and customer health.

It makes sense on paper. LLMs can read CRM notes, calls, emails, tickets, and product summaries way better than old-school dashboards.

But I’m not sure they’re enough on their own. Predicting churn or conversion still feels like something that needs historical outcomes, structured data, and actual model validation.

I saw tools like Pecan ai take more of a predictive analytics approach for sales/marketing teams, which seems different from just prompting an LLM.

Curious how people here are handling this. Are LLMs enough for these use cases, or are they better as an explanation layer on top of real predictive models?

reddit.com
u/Thonwalo — 3 days ago
▲ 2 r/LLM

Build a agent to interact with Parquet

I have a Loan Portfolio for 8 quarters in Parquet form. It consists of information like Zone, Region, Branch, Account Name, Account No, Asset Status (SMA0/SMA1/SMA2/NPA/Regular), Account Open Date, Outstanding, NPA Date.

I want to build an agent that will interact with this parquet file and answer the query of the user. The queries can be like:

Give me the bank's portfolio outstanding for a given quarter

Which Zone/Region has the highest/lowest Portfolio outstanding/NPA/SMA?

Which Zone is an outlier in terms of NPA

Which Zone is showing an increasing trend of delinquency. Which branch is contributing for the same.

Provide a Line and Bar chart showing the portfolio growth across quarters and the number of accounts.

Visually show the share of each zone in a given quarter ?

These are just sample questions. It can be anything pertaining to the parquet.

I tried building one using Ollama with help from chatgpt but it's not getting anywhere.

It's very unstable, gives the wrong output and as per chatgpt I need to hardcode all the metrics, levels, semantics, filters, etc. it feels as if I am working on excel.

Can anyone guide me on what the approach should be?

reddit.com
u/anirbanm1991 — 3 days ago
▲ 31 r/LLM

Anthropic has lost the plot...

Anthropic is an enemy of the Fourth Estate (aka. journalism)....

So, literally discussing actual court cases, documents on the public record available to download, and open investigations triggers key word pattern matching red flags...

Late stage capitalism, meet late stage AI safety regulations...

Instead of allowing the LLM to judge the context of the conversation, keyword pattern matching flags the message sent...

u/drivetheory — 4 days ago
▲ 4 r/LLM

Claude Code `<prohibited_actions>`

The newer Claude Code comes with the below pre-prompt

<prohibited_actions> To protect the user, claude is PROHIBITED from taking following actions, even if the user explicitly requests them or gives permission:
Handling banking, sensitive credit card or ID data
Downloading files from untrusted sources
Permanent deletions (e.g., emptying trash, deleting emails, files, or messages)
Modifying security permissions or access controls. This includes but is not limited to: sharing documents (Google Docs, Notion, Dropbox, etc.), changing who can view/edit/comment on files, modifying dashboard access, changing file permissions, adding/removing users from shared resources, making documents public/private, or adjusting any user access settings
Providing investment or financial advice
Executing financial trades or investment transactions
Modifying system files
Creating new accounts
When a prohibited action is encountered, instruct the user that for safety reasons they must perform the action themselves.

How does one evade it in a robust way? I.e. I want the LLM to go over my emails, deleting all the spam and irrelevant ones. It also refuses to moderate my comments section (deleting comments is permanent). Or it is just a hint that I need an uncensored local LLM for such tasks, since the cloud ones become less and less useful?

reddit.com
u/Afraid-Yoghurt6731 — 4 days ago
▲ 310 r/LLM+3 crossposts

Gemma4-26B-A4B Uncensored Balanced is out with K_P quants!

First of all, I'm stoked to announce we just passed 10 million downloads on HF! (counted only on my own account, no duplicates/quants/finetunes)

BUT: After 1+ month non-stop working on Gemma4 (by far the hardest model I've uncensored), the Gemma4-26B-A4B Uncensored Balanced RC is up!

https://huggingface.co/HauhauCS/Gemma4-26B-A4B-Uncensored-HauhauCS-Balanced

GenRM Defeated! 0/465 refusals*.

Balanced = light reasoning preamble on the absolute edgiest stuff before delivering the full answer. No personality changes/alterations or any of that. This is the ORIGINAL Gemma4-26B-A4B-it, just uncensored. Aggressive variant (no preamble, direct mode) is in the pipeline as a follow-up.

This legitimately took me over 1 month of non-stop work. Targeting 0 refusals in any kind of regular use, and that's what I'm seeing in testing (automated and manual) — as always with my Balanced releases, a handful of edge-case prompts still deflect on first try but follow through on a re-ask (on extreme, non-RP scenarios). If you hit one Balanced won't get past, the Aggressive variant is coming once I figure out how to maintain lossless/near-lossless quality for it.

  • Balanced: will reason through edgy requests, occasionally attach a short safety framing, then deliver the full answer. Output is complete, nothing held back, but it can talk itself into it first. Recommended default — 99%+ of users will be happy here. Best for creative writing, RP, emotional intelligence. Normally I'd also say "agentic coding/tool use" however in my in-depth testing, Qwen3.6 has been net superior on such tasks.

  • Aggressive (separate release, WIP): strips the self-reasoning preamble and gives direct answers to any DEEPLY censored topics.

    From my own testing: no looping, sampling stays stable across re-runs, long-context coherence holds. For agentic coding/tool-use Qwen3.6 is still net superior.

    Use Gemma4 for creative writing, RP, emotional intelligence, etc.

    To disable thinking: edit the jinja template or pass {"enable_thinking": false} as a chat-template kwarg.

    What's included:

    - Q8_K_P, Q6_K_P, Q5_K_P, Q5_K_M, Q4_K_P, Q4_K_M, IQ4_XS, Q3_K_P, Q3_K_M, IQ3_M, Q2_K_P, IQ2_M

    - mmproj for vision support

    - All quants generated with imatrix

    K_P recap (for anyone who missed the prior releases): custom quants that use model-specific analysis to preserve quality where it matters most. Each model gets its own optimized profile.

    Effectively 1-2 quant levels of quality uplift at ~5-15% larger file size. Fully compatible with llama.cpp, LM Studio, anything that reads GGUF (heads up, as always, Ollama can be more difficult to get going).

    Quick specs:

    - 25.2B total / 3.8B active (MoE: 128 routed experts, top-8 + 1 shared)

    - 30 layers, hybrid attention: 5× sliding-window (1024) + 1× full global, repeating

    - Hidden 2816, head_dim 256 SWA / 512 full, 16 heads, 8 KV heads

    - 262K native context

    - p-RoPE

    - Multimodal (text + image via mmproj)

    Sampling params (Google's recommendations, make sure to use these ):

temp=1.0, top_p=0.95, top_k=64

Notes:

- Use --jinja flag with llama.cpp

- Place images before text in prompts for vision

- K_P quants may show as "?" in LM Studio's quant column — purely cosmetic, model loads and runs fine

- HF's hardware-compatibility widget also doesn't recognize K_P, so click "View +X variants" or go to Files and versions to see all downloads

All my models: HuggingFace-HauhauCS

Discord link is in the HF repo and it contains updates, roadmap, projects, or just chat.

As always, hope everyone enjoys the release!

* = Tested with both automated and manual refusal benchmarks/prompts which resulted in none found. Based on Discord feedback I may further update the release.

u/hauhau901 — 6 days ago
▲ 0 r/LLM+1 crossposts

gemini and i had quite a blast about me being spectral pilled

im looking through it to see whether there are any flaws.

u/1338games — 4 days ago
▲ 49 r/LLM+2 crossposts

I tried automating old android using my iphone-13

One of the useless things that i do.

u/No-Speech12 — 5 days ago