References

This page contains all citations used throughout the Humanoid Robotics Book in APA 7th edition format.

Official Documentation

ROS 2

Open Robotics. (2024). ROS 2 Documentation: Humble Hawksbill. https://docs.ros.org/en/humble/

Gazebo & Unity

Open Robotics. (2024). Gazebo Documentation. https://gazebosim.org/docs

Unity Technologies. (2024). Unity Robotics Hub. https://github.com/Unity-Technologies/Unity-Robotics-Hub

NVIDIA Isaac

NVIDIA. (2024). Isaac Sim Documentation. https://docs.omniverse.nvidia.com/isaacsim/latest/

NVIDIA. (2024). Isaac ROS Documentation. https://nvidia-isaac-ros.github.io/

Open Robotics. (2024). Nav2 Documentation. https://nav2.org/

OpenAI Whisper

OpenAI. (2024). Whisper [Software]. GitHub. https://github.com/openai/whisper

Academic Papers

Speech Recognition

Radford, A., Kim, J. W., Xu, T., Brockman, G., McLeavey, C., & Sutskever, I. (2022). Robust Speech Recognition via Large-Scale Weak Supervision. arXiv preprint arXiv:2212.04356. https://arxiv.org/abs/2212.04356

Vision-Language-Action Models

Ahn, M., Brohan, A., Brown, N., Chebotar, Y., Cortes, O., David, B., Finn, C., Gopalakrishnan, K., Hausman, K., Herzog, A., Ho, D., Hsu, J., Ibarz, J., Ichter, B., Irpan, A., Jang, E., Ruano, R. J., Jeffrey, K., Jesmonth, S., ... Zeng, A. (2022). Do As I Can, Not As I Say: Grounding Language in Robotic Affordances. arXiv preprint arXiv:2204.01691. https://arxiv.org/abs/2204.01691

Driess, D., Xia, F., Sajjadi, M. S. M., Lynch, C., Chowdhery, A., Ichter, B., Wahid, A., Tompson, J., Vuong, Q., Yu, T., Huang, W., Chebotar, Y., Sermanet, P., Duckworth, D., Levine, S., Vanhoucke, V., Hausman, K., Toussaint, M., Greff, K., ... Florence, P. (2023). PaLM-E: An Embodied Multimodal Language Model. arXiv preprint arXiv:2303.03378. https://arxiv.org/abs/2303.03378

Vision-Language Models

Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., & Sutskever, I. (2021). Learning Transferable Visual Models From Natural Language Supervision. arXiv preprint arXiv:2103.00020. https://arxiv.org/abs/2103.00020

Note: All citations follow APA 7th edition format. URLs were verified as of the date of publication.

Official Documentation​

ROS 2​

Gazebo & Unity​

NVIDIA Isaac​

Navigation​

OpenAI Whisper​

Academic Papers​

Speech Recognition​

Vision-Language-Action Models​

Vision-Language Models​