u/Responsible-Grass452

▲ 41 r/agi+1 crossposts

Why Physical AI May Not Scale Like Language Models

Matthew Johnson-Roberson, Dean of the College of Connected Computing at Vanderbilt University and former director of the Robotics Institute at Carnegie Mellon, argues that physical AI may not follow the same path as large language models.

Language models had a clear training target: predict the next word. That gave researchers a simple objective that could be scaled across massive amounts of text.

Robotics does not appear to have the same equivalent yet.

A robot can collect large amounts of video, sensor and encoder data, but that does not automatically solve the harder problem: what should the system actually optimize for?

Predicting the next frame, joint angle or robot motion is not as universal as predicting the next word in a sentence.

u/Responsible-Grass452 — 16 hours ago
▲ 21 r/supplychain+2 crossposts

Q1 2026 robot order data shows automation demand broadening beyond automotive

A3 released its Q1 2026 North American robot order data, and the main story is not just that orders held steady. It is where the growth came from.

North American companies ordered 9,055 robots valued at $543 million in Q1 2026. Total unit orders were essentially flat compared with Q1 2025, but Automotive OEM orders were down 35.1% in units and 48.2% in revenue.

Several other industries moved in the opposite direction. Life sciences/pharma/biomed robot orders increased 54.1% in units, semi/electronics/photonics increased 31.7%, plastics and rubber increased 25.2%, food and consumer goods increased 16%, and automotive component suppliers increased 28.1%.

Collaborative robots were also up sharply, with 1,637 cobots ordered in Q1. That represents a 55.6% increase in units and a 78.2% increase in revenue compared with the same quarter last year.

automate.org
u/Responsible-Grass452 — 6 days ago

GigE Vision 3.0 officially released, adding RoCEv2 support for lower-latency industrial image transfer

A3 has officially released GigE Vision 3.0 after unanimous approval by the GigE Vision Technical Committee at the spring International Vision Standards Meeting in Prague.

The main technical change is support for RoCEv2, which allows direct memory access from a device such as a camera to a computer without routing image data through the operating system. In practice, this enables zero-copy image transfer, reducing CPU load and latency while leaving more system resources available for image processing.

That matters for machine vision systems using faster cameras, higher resolutions and multi-camera aggregation. The release also introduces the GigE Vision RDMA Streaming Protocol and expands the control channel to allow more data per packet.

A3 says the update can help systems reach bandwidths of 400G and above using RoCEv2-capable NICs, while keeping the interoperability GigE Vision is known for.

This seems especially relevant for high-speed inspection, multi-camera systems and automated imaging setups where the bottleneck is not just capturing images, but moving them fast enough for real-time processing.

automate.org
u/Responsible-Grass452 — 6 days ago

Sergey Levine on why home robots may not need to look human

Sergey Levine describes physical AI as something broader than humanoid robots doing household chores.

People often imagine the future of home robotics as a general-purpose robot shaped like a person. Levine gives a different framing. Once computers had general-purpose operating systems, they did not all take the same form. They became desktops, phones, cars, appliances, embedded systems, and many other devices shaped around specific uses.

The same logic can apply to physical AI in the home.

A laundry robot, a dishwashing robot, a mobility assistance robot, a cleaning system, and a home monitoring device may not need the same body. Some may move. Some may be fixed. Some may be built into appliances or furniture. Some may not look like robots at all.

u/Responsible-Grass452 — 7 days ago
▲ 15 r/agi

Sergey Levine on the future of AI training beyond internet data

Sergey Levine frames today’s AI systems as being trained mostly on records of human experience.

Text, images, video, code, and other internet-scale data are powerful, but they are still indirect. They come from people who already understand gravity, friction, tools, failure, cause and effect, and the basic rules of physical environments.

Physical AI could add a different kind of training signal. A robot operating in the real world does not only observe outcomes. It takes actions and gathers data from what actually happens next. Objects slip. Plans fail. Tools behave differently than expected. The environment pushes back.

Levine’s argument is that large-scale physical systems could eventually produce data that matters beyond robotics. That data could help models learn more about causality, intuitive physics, object interaction, long-horizon behavior, and recovery when a plan breaks down.

u/Responsible-Grass452 — 7 days ago

Sergey Levine on robot data and how generalist model beat task-specific systems

Sergey Levine describes a robotics project where his team contacted 33 research labs and asked them to share data from their own robot setups.

Each lab had different robots and different tasks. Some were working on cable routing, while others were working on taking out the trash or putting objects into drawers.

His team trained one model across all of that data and sent it back to some of the labs to compare against the systems those labs had built for their own tasks.

According to Levine, the generalist model performed about 50% better on average than the lab-specific systems.

u/Responsible-Grass452 — 7 days ago

Boston Dynamics GM on Data Gap Between Tasks and Full Deployment

Zach Jackowski, GM of Atlas at Boston Dynamics, talks about how getting humanoids into real environments matters, but running the same behavior at scale is not enough. If a fleet is only doing automotive part sequencing, the resulting dataset will mostly improve performance on that task family. It does not automatically produce broad manipulation generalization.

That is why he pushes back on the idea that the path is simply “deploy robots, collect lots of data, and generalization follows.” The harder part is collecting varied, useful data while still operating in controlled enough settings to make the robot commercially useful.

u/Responsible-Grass452 — 8 days ago
▲ 3 r/automation+1 crossposts

What it will take for humanoid robots to actually work on a factory floor

Humanoid robots are getting a lot of attention, but manufacturing adoption will come down to practical realities.

They need to operate safely around workers. They need useful runtime. They need reliable uptime. They need to justify their cost compared with existing automation. They need to handle real workflows, not just polished demos.

This article looks at where humanoids may fit in industrial settings, including line feeding, tote transport, bin picking, and palletizing. It also covers the remaining hurdles around safety standards, battery life, commercialization, workforce integration, and physical AI.

automate.org
u/Responsible-Grass452 — 8 days ago
▲ 130 r/RobotVacuums+1 crossposts

Colin Angle, Roomba co-founder and former iRobot CEO, has launched a new company called Familiar Machines & Magic focused on home robotics.

His view is that humanoids are not the obvious starting point for robots in the home. A home robot should be designed around the job it is meant to do, not around copying the human body. A $20,000 humanoid pushing an upright vacuum is not a practical use case when robot vacuums already exist.

For home robotics, Angle points toward robots built around routine, interaction, wellness, and companionship rather than general-purpose humanoids trying to handle household chores.

u/Responsible-Grass452 — 13 days ago

Researchers at Brown University are exploring a more natural way to communicate with robots by combining human gestures and spoken language. The team trained Spot robot from Boston Dynamics to retrieve objects using both pointing gestures and verbal instructions, similar to how people interact with dogs.

To make gestures usable for the robot, they modeled them in 3D space, while language inputs were handled using existing vision-language AI systems. The combined approach was structured using a Partially Observable Markov Decision Process, allowing the robot to interpret incomplete information and still make decisions.

In testing, the system reached about an 89% success rate in finding objects in complex environments. Performance dropped as environments became more complicated, and camera positioning still limits how well the robot can interpret gestures.

automate.org
u/Responsible-Grass452 — 15 days ago
▲ 45 r/homeautomation+2 crossposts

Colin Angle, who helped turn Roomba into a household name, is now working on something very different.

His new company, Familiar Machines & Magic, is building a home robot that isn’t focused on cleaning at all. Instead of another vacuum or mop, the goal is a mobile companion that can move around the house, follow people, and interact more like a pet than an appliance.

The idea seems to come from a limitation Angle ran into at iRobot. Expanding beyond vacuuming into things like mopping never really took off, partly because those tasks don’t justify the same kind of spending. So rather than stacking more chores onto a robot, this new approach leans into presence and interaction.

automate.org
u/Responsible-Grass452 — 19 hours ago

Mechanical camming has long been used to synchronize motion between machine axes using fixed hardware profiles. It is reliable and repeatable, but any change in motion requires physical modification, which leads to downtime and limited flexibility.

Electronic camming replaces the physical cam with a software-defined relationship between a master axis and a follower axis. The motion profile is stored digitally and executed in real time, allowing adjustments without changing hardware.

This approach reduces mechanical complexity, supports faster changeovers, and allows multiple motion profiles to be stored and reused. Performance depends more on controller capability, servo tuning, and feedback quality rather than mechanical precision.

Electronic camming is commonly used in applications where synchronization and flexibility are required, including packaging, printing, and assembly systems.

automate.org
u/Responsible-Grass452 — 16 days ago

Researchers at EPFL developed a control framework that allows robots with different mechanical designs to perform the same task without rewriting code.

The method captures human-demonstrated actions and converts them into a general motion strategy based on kinematics. Each robot then adapts that strategy to its own joint limits and structure, rather than relying on retraining or large datasets.

In testing, different robots completed parts of the same assembly sequence using the same learned task. Each executed it differently, but within safe operating limits.

The goal is to reduce the need to reprogram tasks when robots are replaced or systems change, while keeping behavior predictable and consistent.

u/Responsible-Grass452 — 16 days ago

Short-range autonomy in industrial settings gets framed differently than highway driving. The focus is on repeatable routes, low speeds, and environments where conditions stay relatively consistent.

More of the challenge sits in near-field perception and precision. Tight spaces, docking, and constant start-stop movement leave less room for error, so coverage and control matter more than long-distance sensing.

Autonomy also ties into a larger sequence of actions. Movement is just one step alongside loading and unloading, so timing and coordination become part of the system.

It leans into a pattern where constraints make the problem more tractable early on, with reliability taking priority before expanding into less predictable environments.

u/Responsible-Grass452 — 20 days ago
▲ 63 r/SelfDrivingCars+1 crossposts

Carnegie Mellon’s Martial Hebert explains that the underlying technology for self-driving cars has been in place for some time, but deployment depends on the conditions the system is operating in.

Driving in heavily mapped, controlled environments with known variables is very different from operating in areas that haven’t been seen before, with changing conditions, varying pedestrian density, and unexpected scenarios.

Each of those factors can require different approaches in sensing, training, and system design. On top of that, systems have to go through extensive testing and validation before they can be used around the general public.

The gap between something that works technically and something that can be validated for real-world use is where most of the time has gone.

u/Responsible-Grass452 — 21 days ago
▲ 2 r/supplychain+2 crossposts

Shipping is layering automation across the entire process rather than replacing it all at once.

Ports are combining automated vehicles, cranes, and tracking systems into shared workflows. Outside the port, sensor-enabled containers are providing continuous condition data during transit.

Adoption is uneven. Some terminals are fully automated, others are partially automated, and many are still mostly manual. Capital costs and labor dynamics are shaping how far each operation can go.

automate.org
u/Responsible-Grass452 — 22 days ago

Professor Ranjay Krishna explains a gap between modern AI and robotics.

Language models can take examples, adapt to new inputs, and improve output in real time. That behavior does not translate to physical systems.

In robotics, if a task changes even slightly, the system often fails. A different object, a new position, or a small variation in the environment can break what it learned.

The idea of showing a robot how to do something once and having it learn by watching is still out of reach. Research areas like imitation learning and continual learning have not solved this in real-world settings.

u/Responsible-Grass452 — 23 days ago