Search — thor

14 results · 9 issues · 5 papers · 0 companies

Issues

9 matches

Robot Wizard Add Colliders Buutton ont show in the windowPain
nvidia-forum:simulation5/14/2026tooling-dx
User reports the Robot Wizard “Add Colliders” button does not show in the window. This likely blocks or slows collider generation during robot import/setup.
isaac-simrobot-wizardcollidersuiusdauthoring
The USB microphone cannot record properly on the Jetson Thor platformPain
nvidia-forum:robotics-edge-computing5/13/2026hardware-integration
A USB microphone reportedly cannot record properly on the Jetson Thor platform. No additional details are provided in the post.
jetsonthorusb-audiomicrophonemultimodal
[BUG] Error issued when adding a D6 joint with 1 angular dof and 1 linear dof to SolverMuJoCoPain
github:newton-physics/newton5/12/2026other
Adding a D6 joint with 1 angular DOF and 1 linear DOF to SolverMuJoCo can produce a repeated joint name error because the name-amending logic doesn't change names in this case. This causes model build failure due to name collisions.
newton
AGX Thor / JetPack 7.1 - DCE_MBOX_TO_DCE_RM left dirty by UEFI causing RmInitAdapter failure on specific unitsBlocker
nvidia-forum:robotics-edge-computing5/12/2026crashes-stability
On AGX Thor / JetPack 7.1, UEFI leaves DCE mailbox state dirty, causing RmInitAdapter failure on specific units. This can prevent GPU driver initialization and system usability.
jetsonthorjetpack-7-1uefirminitadaptergpu-init
NVIDIA Jetson Thor Developer Kit同时使用两条SSD的可能性Friction
nvidia-forum:robotics-edge-computing5/12/2026hardware-integration
User asks about the possibility of using two SSDs simultaneously on the NVIDIA Jetson Thor Developer Kit. This is an integration capability question.
jetsonthorssdstoragepcie
Having a hard time booting from Thor IGX MiniBlocker
nvidia-forum:robotics-edge-computing5/12/2026hardware-integration
User reports difficulty booting from Thor IGX Mini. This blocks platform setup and testing.
jetsonthorigx-minibootstorage-boot
Why were the kernel and dtb partitions removed on Thor?Friction
nvidia-forum:robotics-edge-computing5/11/2026docs-onboarding
User asks why kernel and dtb partitions were removed on Thor. This indicates confusion about platform boot/partition layout changes.
jetsonthorbootpartitionsdtbkernel
[Bug Report] CloudXR JS / Runtime — inbound message channel and hand-tracking poses silently dropped between client and Isaac Sim's OpenXR pluginPain
github:isaac-sim/IsaacLab5/11/2026rendering
In a CloudXR + OpenXR setup, frames stream correctly but inbound messages and hand-tracking poses are silently dropped between client and Isaac Sim’s OpenXR plugin. This blocks teleop commands and hand tracking for interactive workflows.
renderinghardwaredeploymentintegrationisaac-simisaac-lab
Planar mesh geom fails to compile with Qhull errorPain
github:google-deepmind/mujoco5/6/2026crashes-stability
crashusdmujoconewton

Papers

5 matches

Articraft: An Agentic System for Scalable Articulated 3D Asset Generation
2605.151875/14/2026Matt Zhou, Ruining Li, Xiaoyang Lyu, Zhaomou Song …
A bottleneck in learning to understand articulated 3D objects is the lack of large and diverse datasets. In this paper, we propose to leverage large language models (LLMs) to close this gap and generate articulated assets at scale. We reduce the problem of generating an articulated 3D asset to that of writing a program that builds it. We then introduce a new agentic system, Articraft, that writes such programs automatically. We design a programmatic interface and harness to help the LLM do so effectively. The LLM writes code against a domain-specific SDK for defining parts, composing geometry, specifying joints, and writing tests to validate the resulting assets. The harness exposes a restricted workspace and interface to the LLM, validates the resulting assets, and returns structured feedback. In this way, the LLM is not distracted by details such as authoring a URDF file or managing a complex software environment. We show that this produces higher-quality assets than both state-of-the-art articulated-asset generators and general-purpose coding agents. Using Articraft, we build Articraft-10K, a curated dataset of over 10K articulated assets spanning 245 categories, and show its utility both for training models of articulated assets and in downstream applications such as robotics simulation and virtual reality.
usd
SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy
2605.122475/12/2026Yibo Liu, Stanko Oparnica, Simon Shewchun-Jakaitis, Guoyi Fu …
Contact-rich assembly is fundamental in robotics but poses significant challenges due to uncertainties in relative poses, such as misalignments and small clearances in peg-in-hole tasks. Existing approaches typically address search and high-precision insertion separately, because these tasks involve distinct action patterns. However, supporting both tasks within a single model, without switching models or weights, is desirable for intelligent assembly systems. In this work, we propose SI-Diff, a framework that learns both search and high-precision insertion through a force-domain diffusion policy. To this end, we introduce a new mode-conditioning mechanism that enables the policy to capture distinct action behaviors under a single framework. Moreover, we develop a new search teacher policy that can generate diverse trajectories. By training on successful and efficient demonstrations provided by the teacher policy, the model learns the mapping from tactile and end-effector velocity observations to effective action behaviors. We conduct thorough experiments to show that SI-Diff extends the tolerance to x-y misalignments from 2 mm to 5 mm compared to the state-of-the-art baseline, TacDiffusion, while also demonstrating strong zero-shot transferability to unseen shapes.
rlsensors
Control of Fully Actuated Aerial Vehicles: A Comparison of Model-based and Sensor-based Dynamic Inversion
2605.120715/12/2026Ali Sidar Yilmaz, Buday Turan, Lukas Pries, Markus Ryll
Fully actuated multirotor platforms decouple translational force generation from vehicle attitude, enabling independent control of position and orientation and shifting performance limitations from attitude authority to actuator dynamics and control effectiveness. This paper compares a model-based nonlinear dynamic inversion controller (geometric NDI) with a sensor-based incremental dynamic inversion controller (INDI) on a fixed-tilt fully actuated hexarotor. Both controllers share an identical outer-loop structure and are both executed at 500 Hz; therefore, performance differences can be attributed primarily to the inversion strategy. Controller performance is evaluated in five experiments covering attitude step tracking under nominal conditions and under a 50% mismatch in the rotor force coefficient, hover disturbance rejection under an external lateral load, waypoint tracking in the presence of wind gust disturbances, reduced control frequency, and injected sensor degradation. The results show that INDI offers clear advantages under parameter mismatch, gust disturbances, and sensor degradation, and maintains lower position errors across the controller-frequency sweep. However, its advantages are not universal: geometric NDI yields better attitude tracking at reduced control frequencies. To the authors' best knowledge, this work presents the first experimental validation of a full pose tracking INDI controller with decoupled translational and rotational dynamics. These findings highlight the trade-off between measurement-based and model-based inversion for robust control and rapid deployment of fully actuated UAVs.
deployment
RoboBlockly Studio: Conversational Block Programming with Embodied Robot Feedback for Computational Thinking
2605.120595/12/2026Leyi Li, Chenyu Du, Jiafei Sun, Erick Purwanto …
Computational thinking (CT) is increasingly promoted as a core literacy, yet learners and teachers face challenges in connecting abstract program logic to meaningful outcomes. We design and evaluate RoboBlockly Studio, an integrated interactive system that combines block-based programming, a conversational AI teaching agent, and embodied robot execution. RoboBlockly Studio creates a tight iterative loop of authoring, running, observing, and revising. Informed by interviews with five programming teachers, the system was designed to support four goals: (1) preserving learner agency in computational thinking, (2) making program behavior transparent and interpretable, (3) grounding programming in embodied, classroom-aligned tasks, and (4) scaffolding reflection through pedagogically grounded AI dialogue. We deployed RoboBlockly Studio with 32 high school students, observing how robot and AI feedback influenced students' interactions with code, reflections on problem-solving strategies, and understanding of CT concepts. We discuss design insights and implications for creating interactive, embodied learning environments that integrate AI and robotics to support CT learning in computing education.
integration
MobileEgo Anywhere: Open Infrastructure for long horizon egocentric data on commodity hardware
2605.059455/7/2026Senthil Palanisamy, Abhishek Anand, Satpal Singh Rathor, Pratyush Patnaik …
The recent advancement of Vision Language Action (VLA) models has driven a critical demand for large scale egocentric datasets. However, existing datasets are often limited by short episode durations, typically spanning only a few minutes, which fails to capture the long horizon temporal dependencies necessary for complex robotic task execution. To bridge this gap, we present MobileEgo Anywhere, a framework designed to facilitate the collection of robust, hour plus egocentric trajectories using commodity mobile hardware. We leverage the ubiquitous sensor suites of modern smartphones to provide high fidelity, long term camera pose tracking, effectively removing the high hardware barriers associated with traditional robotics data collection. Our contributions are three fold: (1) we release a novel dataset comprising 200 hours of diverse, long form egocentric data with persistent state tracking; (2) we open source a mobile application that enables any user to record egocentric data, and (3) we provide a comprehensive processing pipeline to convert raw mobile captures into standardized, training ready formats for Vision Language Action model and foundation model research. By democratizing the data collection process, this work enables the massive scale acquisition of long horizon data across varied global environments, accelerating the development of generalizable robotic policies.
sensorsfoundation-modelvla

Tip: search matches titles, descriptions, tags, categories, authors and source IDs. Back to overview