u/themoroccanship

Everyone is having the same problem, a lot of people talke about it here, here is a solution.
▲ 1 r/LLM

Everyone is having the same problem, a lot of people talke about it here, here is a solution.

Same as you.... I thought about one problem for months, ai being over confident in giving a wrong answer as much as if they were giving the right answer. This alone cost me a lot of money and time...days of training gone because an agent killed my online gpus...an agent sharing my apis key in a GitHub, an agent sharing one of my moats with the public, an agent deleting all my models memories....

Of course I figured it quickly, it did not think for months on how to solve this, actually, I just added an ai council with 5 individual graders and voila it worked, better quality all over the outputs and actions.

What i thought about for months, is how eliminate the problem, after few experiments, I reduced hallucination, after months... I think I can get rid of it all, to do so, I baked into the architecture of few of models 3 things :

\- Metacognition, the ability for a model to know when it doesn't know something, and simply the ability to say I don't know, instead of over confidently saying anything.

\- Logic and reason gates

\- A new detached system that reads a searchable indexable vector space, and enforces the response of the model. And if it doesn't answer, then the model should not speak...because it does not know it.

When you do all these,you will encounter some new problems, you have to solve, basically the model becomes slow in comparison to an architecture without all these 3. Of course I already solved this.

Hmmm now what, all is good, so how do you measure it, I created NEO, basically an honesty benchmark...

Most benchmarks reward how often a model gets the right answer. NEO rewards honesty about confidence. The leading chat models now answer hard questions correctly most of the time. The remaining failure is the dangerous one: a confident wrong answer that looks identical to a confident right one until you act on it. NEO measures whether a model says "I don't know" when it doesn't — and how often it makes things up instead

Made it for my self to test my model and improve it, but I was curious, who is the most honest model right now ??

So I took NEO, used it on the top 7 frontier models, can you guess who won ? The results in the screenshot. Full research papers and results + GitHub coming soon. What do you think ?

u/themoroccanship — 1 day ago
▲ 0 r/LLM

Everyone is having the same problem, a lot of people talke about it here, here is a solution.

Same as you.... I thought about one problem for months, ai being over confident in giving a wrong answer as much as if they were giving the right answer. This alone cost me a lot of money and time...days of training gone because an agent killed my online gpus...an agent sharing my apis key in a GitHub, an agent sharing one of my moats with the public, an agent deleting all my models memories....

Of course I figured it quickly, it did not think for months on how to solve this, actually, I just added an ai council with 5 individual graders and voila it worked, better quality all over the outputs and actions.

What i thought about for months, is how eliminate the problem, after few experiments, I reduced hallucination, after months... I think I can get rid of it all, to do so, I baked into the architecture of few of models 3 things :

\- Metacognition, the ability for a model to know when it doesn't know something, and simply the ability to say I don't know, instead of over confidently saying anything.

\- Logic and reason gates

\- A new detached system that reads a searchable indexable vector space, and enforces the response of the model. And if it doesn't answer, then the model should not speak...because it does not know it.

When you do all these,you will encounter some new problems, you have to solve, basically the model becomes slow in comparison to an architecture without all these 3. Of course I already solved this.

Hmmm now what, all is good, so how do you measure it, I created NEO, basically an honesty benchmark...

Most benchmarks reward how often a model gets the right answer. NEO rewards honesty about confidence. The leading chat models now answer hard questions correctly most of the time. The remaining failure is the dangerous one: a confident wrong answer that looks identical to a confident right one until you act on it. NEO measures whether a model says "I don't know" when it doesn't — and how often it makes things up instead

Made it for my self to test my model and improve it, but I was curious, who is the most honest model right now ??

So I took NEO, used it on the top 7 frontier models, can you guess who won ? The results in the screenshot. Full research papers and results + GitHub coming soon. What do you think ?

i.redd.it
u/themoroccanship — 1 day ago
▲ 2 r/LLM

From terminal to chatbot UI. New kind of LLM. Coming soon to every hardware you have. From Morocco with love.

Finished the GitHub kit, research papers...Working on auto install feature, so anyone can use it seamlessly and easily.

u/themoroccanship — 2 days ago
▲ 3 r/u_themoroccanship+1 crossposts

I just made a 60k par model more coherent, iam adapting Atome LM to every hardware. Peer Review needed. Should I sell ?

So I was working on a new version of https://www.atomelm.com

It's a tiny LM that runs inside a browser tab.

My first approach, to make it coherent, is to go big, so I made the 944k par too, as you can see in the live demo. But in this new version, I made even the 60k par model coherent, obviously not gonna release yet... My first approach was to adapt it to everything, Atome Bulb LM, Atome car LLM, Atome Router LM, phone LLM, etc...now after I made few prototypes, I get this new obsession, which is to make a website engine, so it's adapt to each visitor... Basically you can have a website with 10000 users, and each user will have slightly different versions based on a few metrics that we will feed to the engine, basically the conversion rate will go through the roof....and what if we add to it heat mapping...analytics, browser data...language, location..is this stupid? Or maybe give it small tasks, like Atome Seo lm...Hmmm.... Should I sell it and focus on the model that beat vanilla style gpt, the one I chatted with it in the screenshots I posted in sub Reddit or maybe open source Atome LM ?

reddit.com
u/themoroccanship — 2 days ago
▲ 1 r/LLM

Why do my llm's chatbot screenshots get deleted ?

I moved it from a terminal based chat web UI chat bot. Only a few days to go for full release.

u/themoroccanship — 3 days ago
▲ 3 r/LLM+1 crossposts

The World's only model that ships as firmware, I have proof and looking for peer review

Be brutally honest, real and genuine with no regard for my feelings.

Please review one of my tiny LM models.

https://www.atomelm.com

There is a live demo. Iam so excited.

I am working on another release where I make the 60k par model more coherent, The first tests look very promising. Just waiting to be 100 percent sure. I may be wrong, that's why I need your help. Please review it.

reddit.com
u/themoroccanship — 4 days ago
▲ 2 r/Moroccopreneur+1 crossposts

Automatic ndroid App Maker by AI, Free test.

Hey I created 4 agents specialized in creating mobile android apps automatically. I'm offering the basic version for free trial. Any can try here :

Autoappmaker.com/deepseek

The ai in this agent is of course in Deepseek, I'm clear about which ai i use on each harness.

You can make simple android apps and put them on App Store and make money with ad mob.

In the back end it cost me < 5$ in Deepseek credits. Of course I have another amazing version of this agent, it uses opus 4.7, it can make better quality app, even clone some very popular apps. Each app I made using this agent V4, cost me around 100$, that's why I could not offer it for the test.

Agent auto app maker V4, give many download options, your apk signed, the bundle ready to deploy in Google play store, the android studio project in case you need to change something in the app manual, access to an online emulator.

To be honest, I created this only for my own personal use and for my clients, but I invested all my money developing an ai from scratch, silly me, and now back to square 1, so iam back working on this to sell it to agencies or solo app Devs or maybe iam thinking of making an online saas for anyone to use..

By the way, I am the same man who said he is going to change the world in two months, 16/08 is the date to save, still working on it, still thinks my ai is better, and yeah, AtomeLM.com is the first lm in the world that ships as firmware and It's just one of our tiny models family still have three surprises in stock...coming soon, as soon as I save some money again or if this university let me use their super compter...

reddit.com
u/themoroccanship — 5 days ago

Keep it or delete it or a partner ?

Few months back, I created learnbdarija.com, fikra o mahiya, webapp tat 3elem nass english B darija, o B tab3 momkin n9elbouha o nredouha el nass english Eli bghaw it3elmo darija.

Kemelt el coding, quelque petits problèmes minors.

Daba mochkilti ana me3gaz, maktebtch derouss, gadithoum hi B AI, qualité Khayba, daba Ila kan chi ostad ola ostada, Eli ikteb derouss 7sen menni ana o 7sen men ai.

So keep it ? Delete it ? Partner ?

reddit.com
u/themoroccanship — 6 days ago
▲ 0 r/LLM

World's smallest LM.

Hey 👋, hope everything is well. Created this tiny LM that can run inside a browser tab. What do you think ?

AtomeLM.com

reddit.com
u/themoroccanship — 7 days ago

Going to change the world in two months. Inchallah

New here, hey 👋. Any really genuine startups here or just copy paste from other advancing countries. So, 15 years doing digital marketing and it, I offer you 30 minutes of consulting, valued at least at 1000 mad, you get it free.

For the title about changing the world, we created the smallest thinking machine in the world, releasing it in two months ⏰

reddit.com
u/themoroccanship — 8 days ago

First AI of it's Kind in the world. And it's 100% Moroccan.

Well, too lazy to explain, any AI, Lm, mcu..people Here ? Tonight, I will share with you the link.

Our lab produced 3 models that the hall ai big companies failed to do. You are first to hear about his, soon enough the hall world will.

Looking for testers.

reddit.com
u/themoroccanship — 8 days ago