LLM在自动驾驶中的Survey

作者 | erkang  编辑 | 自动驾驶之心

原文链接:https://zhuanlan.zhihu.com/p/707464289

点击下方卡片,关注“自动驾驶之心”公众号

戳我-> 领取自动驾驶近15个方向学习路线

>>点击进入→自动驾驶之心大语言模型技术交流群

本文只做学术分享,如有侵权,联系删文

做了粗略的LLM在自动驾驶中的survey

  1. 汇总了相关的paper list

  2. 方便快速了解信息,摘取了abstract,图-问题描述,图-方法overview

  3. TODO:归纳分类 & 极简总结

整体感觉,目前的LLM for Autonomous Driving虽然很火,在三维世界理解下的reasoning还是大有可为,尤其是为端到端服务。

Sec-1: Paper list

[1] LMDrive: Closed-loop End-to-End Driving with Large language Models
[2] Driving Everywhere with Large Language Model Policy Adaptation
[3] ADAPT: Action-aware Driving Caption Transformer
[4] Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs
[5] DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experience
[6] Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Model
[7] DriveGPT4: Interpretable End-to-end Autonomous Driving via Large Language Model
[8] ADriver-I: A General World Model for Autonomous Driving
[9] DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
[10] Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving
[11] Embodied Understanding of Driving Scenarios
[12] Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models
[13] OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
[14] Dolphins: Multimodal Language Model for Driving
[15] HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving
[16] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
[17] RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
[18 ]AD-H: Autonomous Driving with Hierarchical Agents
[19] Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving
[20] LingoQA: Video Question Answering for Autonomous Driving
[21] 3D Dense Captioning beyond Nouns: A Middleware for Autonomous Driving
[22] Probing Multimodal LLMs as World Models for Driving
[23] LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving
[24] LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding|
[25] DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving

Sec-2: Paper汇总

包含Title, Venue, Year, Institute, Abstract, Fig-Problem-define, and Fig-Method-Overview

36d6a929c3d41763ebe022a9e52d2e13.png 27b766ac4a540148442b66a9d5cc2860.png 324af9ee4817b5a34d9dc44bfb2f25f7.png a5bb7e751e56ee64df433109a0463c1e.png 6c1e8dc04fddcd29addde6775026ec57.png 1d16b31c0ea81bd3680a368d19e18578.png 08ceece866c7e760abeab811df672471.png ecfc33cce587a4c8c978d68c163fd6cd.png 4b7fe1f3e1aa9d863ea28ee74a52de87.png a2c8e7d9b4f7fab1fb12e51b2b981f28.png d2af6f9b5dcbaba1ae5f0231c271c878.png 1eeb266b32e2a6c7594c32471255be87.png 1a93275a38c6ff338f197726d6aab756.png 5d28103a22d2af775f69fb5375d2e187.png 66393d010faa0e73b8c22f1cb80ed0f1.png 150a458a85f4dd184d38b299d050204a.png ea2eaf7ad213e59f6fc7171f253e4608.png 323547f6e7c991c62588d6b8b9dc7a91.png 89d2381ea4af18952a4cf7a3c6e908a7.png c1b9579ab4d053c3df5098d623cdd2de.png 4b386f8cf693aff984372e6f068bfa6b.png 225038225ed0d4a68489408e8600ef92.png 262e1b3363172603ce5b85c2e0fd3e30.png 5a77140411bec190f58c5024a46827ce.png 6ebad2fb5d72ccf5d5a3cbfbd5eff795.png

投稿作者为『自动驾驶之心知识星球』特邀嘉宾,欢迎加入交流!

① 全网独家视频课程

BEV感知、BEV模型部署、BEV目标跟踪、毫米波雷达视觉融合多传感器标定多传感器融合多模态3D目标检测车道线检测轨迹预测在线高精地图世界模型点云3D目标检测目标跟踪Occupancy、cuda与TensorRT模型部署大模型与自动驾驶Nerf语义分割自动驾驶仿真、传感器部署、决策规划、轨迹预测等多个方向学习视频(扫码即可学习

e9fb4656395bc9a2aba1027885bcafdc.png

网页端官网:www.zdjszx.com

② 国内首个自动驾驶学习社区

国内最大最专业,近3000人的交流社区,已得到大多数自动驾驶公司的认可!涉及30+自动驾驶技术栈学习路线,从0到一带你入门自动驾驶感知2D/3D检测、语义分割、车道线、BEV感知、Occupancy、多传感器融合、多传感器标定、目标跟踪)、自动驾驶定位建图SLAM、高精地图、局部在线地图)、自动驾驶规划控制/轨迹预测等领域技术方案大模型、端到端等,更有行业动态和岗位发布!欢迎扫描下方二维码,加入自动驾驶之心知识星球,这是一个真正有干货的地方,与领域大佬交流入门、学习、工作、跳槽上的各类难题,日常分享论文+代码+视频

3c201785c9dc54c75f5a3946dd1d9c38.png

③【自动驾驶之心】技术交流群

自动驾驶之心是首个自动驾驶开发者社区,聚焦感知、定位、融合、规控、标定、端到端、仿真、产品经理、自动驾驶开发、自动标注与数据闭环多个方向,目前近60+技术交流群,欢迎加入!扫码添加汽车人助理微信邀请入群,备注:学校/公司+方向+昵称(快速入群方式)

1d86d24d4a7d55eb3b1b2a20fa944d4c.jpeg

④【自动驾驶之心】全平台矩阵

399b92900bc99e3d3086dda3befc2ab5.png

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值