What’s the most useful thing an LLM does for you that isn’t writing or coding
I'm curious what unusual uses people have found that actually stuck. Not theoretical "you could do X" but things you genuinely use.
I'm curious what unusual uses people have found that actually stuck. Not theoretical "you could do X" but things you genuinely use.
I am in a rabbit hole. This is my second semester of my LLM program and I got a C+ in Property. The only thing that comes to my mind is that if I got this grade on Property - a subject that I felt “prepared fo the test” is that I will fail the bar.
I think I had a lot on my plate for this semester: full time student + full time job (like working more than 9 hours daily and attending school from 8AM to 12:15PM)
Plus I took the MPRE - which I missed 5 pts for my jurisdiction.
My last semester was “harder” because I took Evidence and I way better grade than this one. The rest of my classes I got A -
Help, I need to know if this is normal and if I can use this as parameter for my bar prep.
I’ve been looking into AI headshot tools lately and the part that interests me most is not the marketing, it’s the mechanism.
A lot of these tools claim to generate professional results, but the quality gap seems to come down to whether they are doing personalized model training on your own photos or just applying a generic style pipeline. The first approach actually preserves likeness. The second often gives you a polished face that does not quite look like you.
That makes me wonder where the current ceiling is for this use case. Is the limiting factor mostly training data quality, inference consistency, or the model architecture itself? In practical terms, how close are we to reliably generating headshots that hold up across different angles, expressions, and lighting without drifting identity?
This AI headshot tool is one of the names that keeps coming up in non-technical conversations, mostly because people say it looks more like the actual person than the usual AI pretty face output. I’m curious whether that is mostly good product design on top of existing models, or whether there is something more interesting happening technically.
For people here who follow generative image systems closely, what do you think is the real bottleneck in this category right now?
We’re a small municipality (10-15 employees) wanting to build a fully on-prem RAG system for internal documents and regulations. Expected load: max 3-4 concurrent text queries. Strong data privacy requirements, no cloud.
Questions:
Looking for good speed without overkill.
Thanks!
It took a while, but it's finally here, the new and improved v2 of Qwen3.6-27B Uncensored Heretic:
Safetensors: https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2
GGUFs: https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-GGUF
Comes with benchmark too.
Find all my models here: HuggingFace-LLMFan46
I just added latest AI models from ChatGPT 5.5 to Claude Opus 4.6 & 4.7 with 40+ other different AI Models for only $10/mo
Unlimited Tokens for yearly plan.
Provided in both Safetensors and GGUFs.
Safetensors: llmfan46/G4-MeroMero-31B-uncensored-heretic: https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic
GGUFs: llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF: https://huggingface.co/llmfan46/G4-MeroMero-31B-uncensored-heretic-GGUF
I can make also GPTQs and NVFP4s if anyone asks for them.
Find all my models here (big selection of uncensored RP models): HuggingFace-LLMFan46
The original author of this finetune is: zerofata
Provided in both Safetensors and GGUFs.
llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic: https://huggingface.co/llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic
llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic-GGUF: https://huggingface.co/llmfan46/gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic-GGUF
I can make also GPTQs and NVFP4s if anyone asks for them.
Find all my models here (big selection of uncensored RP models): HuggingFace-LLMFan46
I want to know which AI agents with web browsing people are actually using in real life.
Not just feature lists or marketing claims, but what you genuinely open when you need to search something online, check up-to-date information, or pull data from multiple sources.
I’m mostly interested in practical use:
Which ones feel reliable for real-time web research
Whether they actually improve the process or just poorly summarize what they find
In what situations they work well and where they tend to fail
And whether anyone here sticks to one tool or switches between multiple depending on the task
I’ve been looking into AI headshot tools lately and the part that interests me most is not the marketing, it’s the mechanism.
A lot of these tools claim to generate “professional” results, but the quality gap seems to come down to whether they are doing personalized model training on your own photos or just applying a generic style pipeline. The first approach actually preserves likeness. The second often gives you a polished face that does not quite look like you.
That makes me wonder where the current ceiling is for this use case. Is the limiting factor mostly training data quality, inference consistency, or the model architecture itself? In practical terms, how close are we to reliably generating headshots that hold up across different angles, expressions, and lighting without drifting identity?
This AI headshot tool is one of the names that keeps coming up in non-technical conversations, mostly because people say it looks more like the actual person than the usual “AI pretty face” output. I’m curious whether that is mostly good product design on top of existing models, or whether there is something more interesting happening technically.
For people here who follow LLMs and generative image systems closely, what do you think is the real bottleneck in this category right now?
I keep seeing teams try to use ChatGPT/Claude for things like lead scoring, churn risk, upsell potential, and customer health.
It makes sense on paper. LLMs can read CRM notes, calls, emails, tickets, and product summaries way better than old-school dashboards.
But I’m not sure they’re enough on their own. Predicting churn or conversion still feels like something that needs historical outcomes, structured data, and actual model validation.
I saw tools like Pecan ai take more of a predictive analytics approach for sales/marketing teams, which seems different from just prompting an LLM.
Curious how people here are handling this. Are LLMs enough for these use cases, or are they better as an explanation layer on top of real predictive models?
I have a Loan Portfolio for 8 quarters in Parquet form. It consists of information like Zone, Region, Branch, Account Name, Account No, Asset Status (SMA0/SMA1/SMA2/NPA/Regular), Account Open Date, Outstanding, NPA Date.
I want to build an agent that will interact with this parquet file and answer the query of the user. The queries can be like:
Give me the bank's portfolio outstanding for a given quarter
Which Zone/Region has the highest/lowest Portfolio outstanding/NPA/SMA?
Which Zone is an outlier in terms of NPA
Which Zone is showing an increasing trend of delinquency. Which branch is contributing for the same.
Provide a Line and Bar chart showing the portfolio growth across quarters and the number of accounts.
Visually show the share of each zone in a given quarter ?
These are just sample questions. It can be anything pertaining to the parquet.
I tried building one using Ollama with help from chatgpt but it's not getting anywhere.
It's very unstable, gives the wrong output and as per chatgpt I need to hardcode all the metrics, levels, semantics, filters, etc. it feels as if I am working on excel.
Can anyone guide me on what the approach should be?
Anthropic is an enemy of the Fourth Estate (aka. journalism)....
So, literally discussing actual court cases, documents on the public record available to download, and open investigations triggers key word pattern matching red flags...
Late stage capitalism, meet late stage AI safety regulations...
Instead of allowing the LLM to judge the context of the conversation, keyword pattern matching flags the message sent...
The newer Claude Code comes with the below pre-prompt
<prohibited_actions> To protect the user, claude is PROHIBITED from taking following actions, even if the user explicitly requests them or gives permission:
Handling banking, sensitive credit card or ID data
Downloading files from untrusted sources
Permanent deletions (e.g., emptying trash, deleting emails, files, or messages)
Modifying security permissions or access controls. This includes but is not limited to: sharing documents (Google Docs, Notion, Dropbox, etc.), changing who can view/edit/comment on files, modifying dashboard access, changing file permissions, adding/removing users from shared resources, making documents public/private, or adjusting any user access settings
Providing investment or financial advice
Executing financial trades or investment transactions
Modifying system files
Creating new accounts
When a prohibited action is encountered, instruct the user that for safety reasons they must perform the action themselves.
How does one evade it in a robust way? I.e. I want the LLM to go over my emails, deleting all the spam and irrelevant ones. It also refuses to moderate my comments section (deleting comments is permanent). Or it is just a hint that I need an uncensored local LLM for such tasks, since the cloud ones become less and less useful?
First of all, I'm stoked to announce we just passed 10 million downloads on HF! (counted only on my own account, no duplicates/quants/finetunes)
BUT: After 1+ month non-stop working on Gemma4 (by far the hardest model I've uncensored), the Gemma4-26B-A4B Uncensored Balanced RC is up!
https://huggingface.co/HauhauCS/Gemma4-26B-A4B-Uncensored-HauhauCS-Balanced
GenRM Defeated! 0/465 refusals*.
Balanced = light reasoning preamble on the absolute edgiest stuff before delivering the full answer. No personality changes/alterations or any of that. This is the ORIGINAL Gemma4-26B-A4B-it, just uncensored. Aggressive variant (no preamble, direct mode) is in the pipeline as a follow-up.
This legitimately took me over 1 month of non-stop work. Targeting 0 refusals in any kind of regular use, and that's what I'm seeing in testing (automated and manual) — as always with my Balanced releases, a handful of edge-case prompts still deflect on first try but follow through on a re-ask (on extreme, non-RP scenarios). If you hit one Balanced won't get past, the Aggressive variant is coming once I figure out how to maintain lossless/near-lossless quality for it.
Balanced: will reason through edgy requests, occasionally attach a short safety framing, then deliver the full answer. Output is complete, nothing held back, but it can talk itself into it first. Recommended default — 99%+ of users will be happy here. Best for creative writing, RP, emotional intelligence. Normally I'd also say "agentic coding/tool use" however in my in-depth testing, Qwen3.6 has been net superior on such tasks.
Aggressive (separate release, WIP): strips the self-reasoning preamble and gives direct answers to any DEEPLY censored topics.
From my own testing: no looping, sampling stays stable across re-runs, long-context coherence holds. For agentic coding/tool-use Qwen3.6 is still net superior.
Use Gemma4 for creative writing, RP, emotional intelligence, etc.
To disable thinking: edit the jinja template or pass {"enable_thinking": false} as a chat-template kwarg.
What's included:
- Q8_K_P, Q6_K_P, Q5_K_P, Q5_K_M, Q4_K_P, Q4_K_M, IQ4_XS, Q3_K_P, Q3_K_M, IQ3_M, Q2_K_P, IQ2_M
- mmproj for vision support
- All quants generated with imatrix
K_P recap (for anyone who missed the prior releases): custom quants that use model-specific analysis to preserve quality where it matters most. Each model gets its own optimized profile.
Effectively 1-2 quant levels of quality uplift at ~5-15% larger file size. Fully compatible with llama.cpp, LM Studio, anything that reads GGUF (heads up, as always, Ollama can be more difficult to get going).
Quick specs:
- 25.2B total / 3.8B active (MoE: 128 routed experts, top-8 + 1 shared)
- 30 layers, hybrid attention: 5× sliding-window (1024) + 1× full global, repeating
- Hidden 2816, head_dim 256 SWA / 512 full, 16 heads, 8 KV heads
- 262K native context
- p-RoPE
- Multimodal (text + image via mmproj)
Sampling params (Google's recommendations, make sure to use these ):
temp=1.0, top_p=0.95, top_k=64
Notes:
- Use --jinja flag with llama.cpp
- Place images before text in prompts for vision
- K_P quants may show as "?" in LM Studio's quant column — purely cosmetic, model loads and runs fine
- HF's hardware-compatibility widget also doesn't recognize K_P, so click "View +X variants" or go to Files and versions to see all downloads
All my models: HuggingFace-HauhauCS
Discord link is in the HF repo and it contains updates, roadmap, projects, or just chat.
As always, hope everyone enjoys the release!
* = Tested with both automated and manual refusal benchmarks/prompts which resulted in none found. Based on Discord feedback I may further update the release.
im looking through it to see whether there are any flaws.
One of the useless things that i do.