IA News & Research

Suivi en temps réel de la révolution IA : modèles, outils et recherche.

🔥 Hot ▲ 5.2k r/Innovation+11 crossposts

“AI vs Creativity” from ‘GTA’ (TakeTwo) CEO

u/ateam1984 — 2 hours ago
▲ 1.2k r/theydidthemath+1 crossposts

Google's Antigravity 2.0 creates an operating system from scratch using 96 agents in 12 hours for under $1K in token costs - and it runs Doom

u/Distinct-Question-16 — 1 hour ago

Gen Z's AI backlash is getting louder

This graduation season, AI has become an unwelcome topic at commencement ceremonies across the US. At the University of Arizona, former Google CEO Eric Schmidt was met with widespread boos from nearly 10k graduates as he spoke about the rise of AI. Similar reactions played out at the University of Central Florida and Middle Tennessee State University.

The reason is very simple: unemployment among college graduates aged 22 to 27 has hit its highest level in twelve years. About 70% of college students see AI as a threat to their job prospects.

When you're already struggling to find work, being told to embrace the technology that might be taking those opportunities away. Who would be satisfied?

https://www.independent.co.uk/news/world/americas/ai-college-commencement-speakers-job-market-b2979818.html

u/Weird_Scallion_2498 — 3 hours ago
▲ 150 r/upscaling+2 crossposts

Nvidia RTX 2 pass Upscaler (4GB VRAM + 8GB RAM)

Official Link : Nvidia docs

NVIDIA RTX 2-Pass Upscaler (4GB VRAM + 8GB RAM)

Post:

Hi everyone!

Recently, while working on AI videos with the LTX2.3 model, I started thinking a lot about upscaling efficiency, so I made my own RTX Upscale node for ComfyUI.

In the existing ComfyUI setup, most workflows mainly used Video Super Resolution (VSR), but NVIDIA RTX upscaling actually has four different options. I implemented all four of them in this node.

After testing it myself, I honestly no longer feel a need to subscribe to Topaz AI.

- DeBlur: The most effective option for sharpening blurry videos, especially AI-generated videos.

- DeNoise: Helps clean up noisy footage. For AI videos, I recommend using it selectively.

- High Bitrate: Good for improving the quality of cleaner source videos.

- Video Super Resolution (VSR): The standard method that was commonly used before.

The main idea I applied is a 2-step upscaling method.

First, DeBlur is used to sharpen the video, and then High Bitrate or VSR is applied as the second pass. In my tests, this produced much better results.

Performance and requirements:

- On an RTX 5090, upscaling a 512x512 video to 1024x1024 takes about 5 seconds.

- For Low RAM / Low VRAM environments, I made a Batch image workflow. With this method, most low-spec systems can usually finish the upscaling within about 1-2 minutes.

- When using the Batch image method, the requirement is around 10GB RAM and 4GB VRAM.

Existing NVIDIA RTX Super Resolution nodes were very difficult to install because the backend setup often caused errors. So I prepared an install_rtx_vfx helper to make the backend installation as close to one-click as possible.

Installation:

  1. Open ComfyUI Manager → Custom Node Manager, then search for deno-custom-nodes and install it.
  2. Important: Completely close ComfyUI before running the installer. If ComfyUI is still running, the installation may not proceed.
  3. Go to ComfyUI/custom_nodes/deno-custom-nodes/tools.
  4. Run install_rtx_vfx.bat → wait for the installation complete message, then close the window. It usually takes about 30 seconds to 1 minute.
  5. Restart ComfyUI and run the Deno RTX Video Super Resolution (2 Pass) node.

For detailed usage, please check the tutorial and workflow links below.

Link : WorkFlow

Link : Tutorial

ㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡㅡ
The DENO RTX Video Super Resolution update is currently being rolled out to ComfyUI Manager / Registry, so it may take a few hours before it appears for everyone. If you want to test it early, please follow the manual installation steps below.

First, completely close ComfyUI. This means closing not only the browser tab, but also the ComfyUI command window, cmd, PowerShell, or any terminal window that is running ComfyUI.

Download the installer from the official DENO GitHub repository:

https://github.com/Deno2026/comfyui-deno-custom-nodes/raw/refs/heads/main/tools/install_rtx_vfx_bat.zip

After downloading the zip file, extract it first. Do not run the .bat file directly from inside the zip file.

After extraction, you will see this file:

install_rtx_vfx.bat

Copy or move this file into the tools folder of your installed DENO custom nodes:

ComfyUI\custom_nodes\deno-custom-nodes\tools\

For example, the final location should look similar to this:

D:\ComfyUI\custom_nodes\deno-custom-nodes\tools\install_rtx_vfx.bat

Important:

Do not run install_rtx_vfx.bat from your Downloads folder. It must be placed inside:

ComfyUI\custom_nodes\deno-custom-nodes\tools\

Once the file is in the correct tools folder, double-click install_rtx_vfx.bat to run it.

If Windows shows a security warning, click “More info” and then “Run anyway.”

When the installer shows the ComfyUI Python path, check that it points to the python_embeded\python.exe used by the ComfyUI you just closed. If the path looks correct, type:

Y

and press Enter.

This installer installs NVIDIA’s official nvidia-vfx Python package from NVIDIA’s official package server, pypi.nvidia.com. It does not download random DLL files.

When you see a green “INSTALL COMPLETE” message or “[OK] NVIDIA RTX VFX is installed,” the installation is complete.

After that, restart ComfyUI and search for:

(Deno) RTX Video Super Resolution

Notes:

- You need an NVIDIA RTX GPU.

- Please use the latest NVIDIA driver.

- macOS is not supported.

- If you do not have the folder ComfyUI\custom_nodes\deno-custom-nodes\tools, please update DENO custom nodes first through ComfyUI Manager or GitHub, then try again.

u/Extension-Yard1918 — 3 hours ago

Let’s talk quants of Gemma and Qwen - 16 vs Q8 vs Q4 - any experiences?

Some people say they’d never go under Q8, and others say they find Q3 acceptable! What’s your take?

reddit.com
u/Borkato — 3 hours ago
▲ 26 r/computervision+1 crossposts

How to get rejected by IEEE T-PAMI with 'Excellent' scores?[D]

Hello everyone. I am keeping my identity anonymous today to protect my professional career. I am a researcher in Computer Vision, and I am sharing this story because I have hit a devastating deadlock with IEEE T-PAMI and the IEEE Ethics Office.

Our Situation

https://preview.redd.it/ipxwj6eus32h1.jpg?width=960&format=pjpg&auto=webp&s=1f58700644683be640f6bb057c74011649f59219

In the decision letter, there were three highly positive reviews (Two EXCELLENT, One GOOD). However, the AE (who is one of T-PAMI associate EICs) rejected the paper by quoting comments from a "4th" reviewer.

>The most staggering part: We later accidentally met the actual 4th reviewer. He CONFIRMED having submitted a POSITIVE review, which was strangely withdrawn by the editor in the backend before the final decision was made.

The AE lied by saying: "... received 3 sets of comments, and one on the way ... ".

We have formally requested the IEEE (and Computer Society) to thoroughly investigate this issue, specifically asking them to check AE's backend activity logs in the submission system.

However, half a year has passed, and we have received no direct response.

Has anyone experienced something similar with IEEE or other top venues? Any advice or help bringing visibility to this would be greatly appreciated.

Evidence:

Below is the report to IEEE Ethics (identifying information has been covered):

https://preview.redd.it/e41vt2rsn02h1.png?width=3508&format=png&auto=webp&s=b2ee2d3f092dad5e20b45b9daeea7fa7b6f01d20

https://preview.redd.it/t29n03rsn02h1.png?width=3508&format=png&auto=webp&s=67aa6bc36aed76617af34e7913a203f9236bc536

https://preview.redd.it/6v5ys2rsn02h1.png?width=3508&format=png&auto=webp&s=f2452998f57f1b157d71b569dd5ff87e4d3d0b6c

https://preview.redd.it/epdxv2rsn02h1.png?width=3508&format=png&auto=webp&s=d01da8cdf9e3f6cd5be53f884b02b154f86d0b48

https://preview.redd.it/fuw3k3rsn02h1.png?width=3508&format=png&auto=webp&s=03e75f763a54429758102da4933af53511642e7d

https://preview.redd.it/xn0ze3rsn02h1.png?width=3508&format=png&auto=webp&s=9f00e88f186c0afa349d4a46439216ae57642d98

reddit.com
u/cussealin — 2 hours ago

48GB VRAM users, what are your daily drivers? Do you wish you had more VRAM? What would you run if you did?

I’m upgrading from 32 to 48 soon and am excited but I’m curious what y’all run!

reddit.com
u/Borkato — 7 hours ago
▲ 74 r/EmailOutreach+63 crossposts

This sub gets the assignment better than most so I'll be direct.

The no-code movement solved half the problem. You can build almost anything now without knowing how to code, which is genuinely incredible and wasn't true five years ago. But there's still a gap that nobody talks about. Even with the best no-code tools you still have to know which tools to pick, how to connect them, how to write copy that converts, how to set up ad accounts, how to source products, how to structure a funnel. The learning curve didn't disappear, it just moved.

Most people in this sub know exactly what I mean. You've spent a weekend deep in Zapier trying to get two things to talk to each other that should just work. You've rebuilt your Webflow site three times because the first two didn't convert. You've watched your Notion dashboard get more elaborate while the actual business stayed the same size.

That's the gap Locus Founder closes.

You describe what you want to build. The AI handles everything else. It sources products directly from AliExpress and Alibaba (or sell YOUR OWN digital services, products, or content), builds a real storefront around them, writes conversion-optimized copy, then autonomously creates and runs ads on Google, Facebook and Instagram. No Zapier. No Webflow. No piecing together eight tools that half work. Just a running business.

If you don't have an idea yet it interviews you and figures out what makes sense for your situation.

We got into YCombinator this year and we're opening 100 free beta spots this week before public launch. Free to use, you keep everything you make.

For the people in this sub specifically, this isn't a replacement for no-code tools for people who love building. It's for everyone who wanted the outcome but never wanted to become a tools expert to get there. Big difference.

Beta form: https://forms.gle/nW7CGN1PNBHgqrBb8

Happy to answer anything about how it works under the hood.

u/IAmDreTheKid — 8 hours ago

LM Studio finally added support for MTP Speculative Decoding

https://preview.redd.it/1uuzjm0ll72h1.png?width=923&format=png&auto=webp&s=1af7d7594be1e08ff7ad6797e2bc53e9410769a3

update to 0.4.14 Build 2 (Beta) and make sure your llama.cpp engine is 2.15.0

https://preview.redd.it/x0vdwjb3n72h1.png?width=742&format=png&auto=webp&s=6367de44208004d2f50194d78a542c46b040dceb

you also must select "Manually choose model load parameters" and enable MTP in those before loading the model it is NOT on by default

reddit.com
u/pigeon57434 — 2 hours ago
▲ 986 r/ArtificialNtelligence+5 crossposts

The American Rebellion Against AI Is Gaining Steam

The only thing growing faster than the artificial-intelligence industry may be Americans’ negative feelings about it, as former Google Chief Executive Eric Schmidt saw on Friday.

Delivering a commencement address at the University of Arizona, Schmidt told students the “technological transformation” wrought by artificial intelligence will be “larger, faster, and more consequential than what came before.” Like some other graduation speakers mentioning AI, Schmidt was met with a chorus of boos.

wsj.com
u/chota-kaka — 10 hours ago
▲ 125 r/edtech+4 crossposts

ChatGPT for homework vs other LLM

Adults use ChatGPT to skip the parts they already understand. Kids use it to skip the part where the thinking would have happened. Let's use AI for our kids that will help them think and learn.

u/bruhagan — 9 hours ago

Demis Hassabis at Google I/O: "Artificial General Intelligence is just a few years away"

Demis’ timeline on AGI has been shrinking and shrinking lately. I believe he would consistently say 5-10 years away the past year or 2. Then he switched to saying only 5 years in an interview this year I believe, and now he just said “just a few years”.

He also had another cool quote at the end: “When we look back at this time I think we will realize that we were standing in the foothills of the singularity”

Demis has been notoriously conservative, so when he says this, he must be pretty confident it’s coming soon.

reddit.com
u/socoolandawesome — 9 hours ago

Running DeepSeek-V4 locally with 4x legacy RTX 2080 Ti ($2k budget setup). Custom Turing kernels, W8A8 quantization, and 255 prefill tok/s!

Hey r/DeepSeek,

Who says we need an H100 cluster or the latest expensive GPUs to run frontier MoE models? I wanted to see how far we could push a single node of consumer legacy hardware, so we spent less than $2,500 total to build a budget machine that successfully runs DeepSeek-V4-Flash (284B total, 13B active) locally!

Surprisingly, we managed to hit around 255 prefill tokens/s with a very tight memory budget.

https://preview.redd.it/cfefgc71732h1.png?width=1772&format=png&auto=webp&s=5c673acca7a2a73cfbd0d2059e25102462c56dfc

Here is a quick breakdown of how we achieved this "legacy donkey pulling a massive MoE chariot" feat via hardware-software co-optimization:

⚡️ The Technical Breakthroughs

  1. Custom Turing CUDA Kernels: The 2080 Ti Tensor Cores are still capable, but PCIe Gen3 and VRAM bandwidth are huge bottlenecks. We rewrote custom CUDA kernels tailored specifically for the Turing architecture to accelerate W8A8 (INT8) matrix multiplication, heavily alleviating the bandwidth choke.
  2. Heterogeneous Inference: Optimized static memory splitting and dynamic offloading between the 4x 11/22GB VRAM and 1TB system RAM. 100% of the hardware capacity is utilized.
  3. Computation-Communication Overlap: Implemented a pipelined execution strategy to hide the massive multi-GPU communication overhead caused by MoE routing.

https://preview.redd.it/5ltwol3z632h1.png?width=2414&format=png&auto=webp&s=6c4c4dcf62737f7f5dcb9a5b8d4aa3f422f7edae

🖥️ Budget Hardware Specs

  • CPU: Intel Xeon E5-2696 v4 (The classic budget king for multi-core)
  • GPU: 4x RTX 2080 Ti (11/22GB each)
  • RAM: 1TB DDR4 ECC

The entire implementation, deployment script, and preliminary tech report are 100% open-sourced. I'd love to hear your thoughts, benchmarks, or feedback from fellow system/compiler hackers here!

🔗 **GitHub Repository:**https://github.com/lvyufeng/deepseek-v4-2080ti

(Note: I submitted the detailed report to arXiv a few days ago, but it’s currently caught in the manual moderation queue—likely because a rookie author throwing a 2080 Ti at DeepSeek-V4 triggered their review boundaries lol. Will update with the arXiv link once it's cleared!)

https://reddit.com/link/1ti5sxu/video/uu9ea2l0v62h1/player

https://reddit.com/link/1ti5sxu/video/if6alov1v62h1/player

reddit.com
u/Known_Ice9380 — 4 hours ago
▲ 140 r/aivideomaking+1 crossposts

Gemini Omni model is out!

I made 4 videos and already hit the limit. The results honestly aren’t any better than VEO 3.1, and now my entire 5-hour usage window is gone 🙂. I can’t even use Gemini 3 Flash or 3.1 Pro anymore, even though I’m on the Pro plan.

u/Able-Line2683 — 8 hours ago