u/Nunki08

▲ 66 r/france

Le pape Léon XIV se rendra pour la première fois en France du 25 au 28 septembre | Le pape Léon XIV se rendra en France du 25 au 28 septembre, la première visite d'Etat officielle d'un souverain pontife dans l'Hexagone depuis 18 ans, a annoncé le Vatican, samedi 16 mai.

franceinfo.fr
u/Nunki08 — 4 days ago

arXiv implements 1-year ban for papers containing incontrovertible evidence of unchecked LLM-generated errors, such as hallucinated references or results. [N]

From Thomas G. Dietterich (arXiv moderator for cs.LG) on 𝕏 (thread):
https://x.com/tdietterich/status/2055000956144935055
https://xcancel.com/tdietterich/status/2055000956144935055

"Attention arXiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated.

If generative AI tools generate inappropriate language, plagiarized content, biased content, errors, mistakes, incorrect references, or misleading content, and that output is included in scientific works, it is the responsibility of the author(s).

We have recently clarified our penalties for this. If a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper.

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue.

Examples of incontrovertible evidence: hallucinated references, meta-comments from the LLM ("here is a 200 word summary; would you like me to make any changes?"; "the data in this table is illustrative, fill it in with the real numbers from your experiments")."

reddit.com
u/Nunki08 — 5 days ago
▲ 1.2k r/math

arXiv implements 1-year ban for papers containing incontrovertible evidence of unchecked LLM-generated errors, such as hallucinated references or results.

From Thomas G. Dietterich (arXiv moderator for cs.LG) on 𝕏 (thread):
https://x.com/tdietterich/status/2055000956144935055
https://xcancel.com/tdietterich/status/2055000956144935055

"Attention arXiv authors: Our Code of Conduct states that by signing your name as an author of a paper, each author takes full responsibility for all its contents, irrespective of how the contents were generated.

If generative AI tools generate inappropriate language, plagiarized content, biased content, errors, mistakes, incorrect references, or misleading content, and that output is included in scientific works, it is the responsibility of the author(s).

We have recently clarified our penalties for this. If a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper.

The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue.

Examples of incontrovertible evidence: hallucinated references, meta-comments from the LLM ("here is a 200 word summary; would you like me to make any changes?"; "the data in this table is illustrative, fill it in with the real numbers from your experiments")."

reddit.com
u/Nunki08 — 5 days ago
▲ 85 r/crypto+1 crossposts

How Unknowable Math Can Help Hide Secrets | Quanta Magazine - Ben Brubaker | A graduate student recently harnessed the complexity of mathematical proofs to create a powerful new tool in cryptography.

The paper: Gödel in Cryptography: Effectively Zero-Knowledge Proofs for NP with No Interaction, No Setup, and Perfect Soundness: https://eprint.iacr.org/2025/1296
Rahul Ilango, Massachusetts Institute of Technology

quantamagazine.org
u/Nunki08 — 8 days ago
▲ 77 r/math

Epoch AI are conducting an AI-assisted review of FrontierMath: Tiers 1-4. This has flagged fatal errors in about a third of problems.

From Epoch AI on 𝕏: https://x.com/EpochAIResearch/status/2053995435870892048

"We are conducting an AI-assisted review of FrontierMath: Tiers 1-4. This has flagged fatal errors in about a third of problems, and we believe most of these flags to be valid. We will release updated scores on a corrected dataset after completing a thorough human review."

https://epoch.ai/frontiermath/tiers-1-4

reddit.com
u/Nunki08 — 8 days ago
▲ 892 r/aivideo

Remarkable AI-Created Animation Short by Marko Slavnic

u/Nunki08 — 9 days ago
▲ 3 r/math

Paper DeepMind: AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI
arXiv:2605.06651 [cs.AI]: https://arxiv.org/abs/2605.06651

Daniel Zheng, Ingrid von Glehn, Yori Zwols, Iuliya Beloshapka, Lars Buesing, Daniel M. Roy, Martin Wattenberg, Bogdan Georgiev, Tatiana Schmidt, Andrew Cowie, Fernanda Viegas, Dimitri Kanevsky, Vineet Kahlon, Hartmut Maennel, Sophia Alj, George Holland, Alex Davies, Pushmeet Kohli

Abstract: "We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician is optimized to provide holistic support for the exploratory and iterative reality of mathematical workflows, including ideation, literature search, computational exploration, theorem proving and theory building. By providing an asynchronous, stateful workspace that manages uncertainty, refines user intent, tracks failed hypotheses, and outputs native mathematical artifacts, the system mirrors human collaborative workflows. In early tests, the AI co-mathematician helped researchers solve open problems, identify new research directions, and uncover overlooked literature references. Besides demonstrating a highly interactive paradigm for AI-assisted mathematical discovery, the AI co-mathematician also achieves state of the art results on hard problem-solving benchmarks, including scoring 48% on FrontierMath Tier 4, a new high score among all AI systems evaluated."

From Pushmeet Kohli on 𝕏: https://x.com/pushmeet/status/2052812585804685322

https://preview.redd.it/4kc57l8n830h1.jpg?width=1491&format=pjpg&auto=webp&s=0a948ee745517347b97567fb5abbee4fc00e899a

reddit.com
u/Nunki08 — 11 days ago

From:
TechCrunch: DeepSeek could hit $45B valuation from its first investment round: https://techcrunch.com/2026/05/06/deepseek-could-hit-45b-valuation-from-its-first-investment-round/

FT (paywall): DeepSeek nears $45bn valuation as China’s ‘Big Fund’ leads investment talks: https://www.ft.com/content/daaf2e0a-4a0d-4d7c-a85b-445480f6b9c7

Bloomberg (paywall): China’s Chip Fund in Talks to Lead DeepSeek Funding, FT Says: https://www.bloomberg.com/news/articles/2026-05-06/china-chip-fund-in-talks-to-lead-mega-deepseek-funding-ft-says

u/Nunki08 — 13 days ago
▲ 64 r/math

GitHub: https://github.com/leanprover-community/lean4game

Website/Demo: https://adam.math.hhu.de/

From Lean on 𝕏: https://x.com/leanprover/status/2052133670434320640
"Many Lean users were first introduced to Lean via the Natural Number Game, a gamified approach to learning mathematical proofs developed by Kevin Buzzard.
The Lean Game Server now hosts 8 games, including real analysis, linear algebra, and introduction to proofs. Open source, so educators can build their own too."

Game/repository Maintainer
Knights and Knaves/Jad Abou Hawili
Linear Algebra Game/ZRTMRH
Logic Game/Trequetrum
Natural Number Game (NNG)/Kevin Buzzard
Real Analysis Game/Alex Kontorovich
Reintroduction to Proofs/Emily Riehl
Robo / Scribble/Marcus Zibrowius
Set Theory Game/Dan Velleman

u/Nunki08 — 13 days ago