RL^2-优快云博客

原创 IsaacLab Installation on Ubuntu 20.04, train a robot!

【代码】IsaacLab Installation on Ubuntu 20.04, train a robot!

2025-02-12 18:08:37 672

原创 [Deep Learning]ANN,CNN,RNN and LSTM

Reference:ANN vs. CNN vs. RNN vs. LSTM: Understanding the Differences in Neural Networks

2025-01-10 17:56:29 370

原创 [Transformer] Self Attention Mechanism

Self Attention Mechanism

2025-01-10 17:11:04 406

原创 [Git Notes] Keep the file on your local machine but remove it from the repository

【代码】[Git] Keep the file on your local machine but remove it from the repository。

2025-01-10 17:08:58 158

原创 [Transformer] The Structure of GPT, Generative Pretrained Transformer

The Structure of Generative Pretrained TransformerReference:The Transformer architecture of GPT modelsHow GPT Models Work

2025-01-10 17:01:37 513

原创 [Transformer] Architecture Overview

But what is a GPT? Visual intro to Transformers | Deep learning, chapter 5Visualizing transformers and attention | Talk for TNG Big Tech Day '24

2025-01-09 15:35:48 228

原创 [Transformer] Word Embedding and Positional Encoding

。

2025-01-09 15:34:40 148

原创 [Reinforcement Learning] How Proximal Policy Optimization (PPO) updates the policy

CLIPθEtminrtθAtcliprtθ1−ϵ1ϵAtWhere:rtθπoldat∣stπθat∣stAttϵrtθnewθoldα∇θLCLIPθHere, αtθtoldAtLCLInewθoldα∇θJθoldHere, αθJ。

2024-12-26 05:43:49 571

原创 [Reinforcement Learning] The log-derivative trick and its application in policy gradient

gi1∏ngixi1∑nloggixxis:dxdlogfxfxf′xxix:fxi1∏ngixxgives:logfxi1∑nloggixxdlogfxi1∑ngixgi′xθa∣sin state sθθEτ∼πθRτ)]where ττθθ。

2024-12-25 17:58:41 686

原创 [Reinforment Learning] Epoch and timestep

【代码】[Reinforment Learning] Epoch and timestep。

2024-12-20 18:22:49 450

原创 [Reinforcement Learning] Agent,Environment, Value,Policy, Actor, Critic, Player, Model

【代码】[Reinforcement Learning] Agent,Environment, Value,Policy, Actor, Critic, Player, Model。

2024-12-20 18:17:23 714

原创 [Git notes] git reset options and branch history backup

【代码】[Git notes] git reset options and branch history backup。

2024-12-18 18:15:40 1153

原创 [Git Notes] Git push conflict resolution.

【代码】[Git Notes] Git push conflict resolution.

2024-12-18 16:27:35 946

原创 [Installation Handbook]uninstall pip and then install miniforge on ubuntu

To uninstall and then install on an Ubuntu system, you can follow these steps. Keep in mind that completely removing can break dependencies for Python packages and is generally not recommended unless you're sure about the implications. Here’s how you ca

2024-12-17 16:05:12 725

原创 Excellent Google Extensions

Obsidian Web Clipper

2024-12-15 05:54:18 96

原创 [Installation handbook] Obsidian--Sharpen your thinking.

【代码】[Installation handbook] Obsidian--Sharpen your thinking.

2024-12-10 17:52:41 920

原创 [GPU Driver] How to find the corresponding GPU driver version for your GPU on Ubuntu

【代码】[GPU Driver] How to find the corresponding GPU driver version for your GPU on Ubuntu。

2024-12-04 21:01:50 733

原创 [GPU Driver]Purge the mismatched drivers and install the matched one to solve Version Mismatch

nvidia-smi。

2024-12-04 20:18:25 910

原创 [张量 Tensor] 基于Python计算的张量维度或轴的实际意义 The dimensions or axes of a tensor in Python(Numpy, PyTorch)

【代码】[Tensor] The dimensions or axes of a tensor in Python(Numpy, PyTorch)

2024-11-25 18:01:08 1050

原创 What is a Tensor?

WTF is a Tensor? What is the difference between tensors and matrixes?

2024-11-22 22:15:28 243

原创 How to paint colors to the assets cube through .urdf

【代码】How to paint colors to the assets cube through .urdf。

2024-11-19 17:39:54 158

原创 Another git process seems to be running in this repository, e.g.an editor opened by ‘git commit‘.

【代码】Another git process seems to be running in this repository, e.g.an editor opened by ‘git commit‘.

2024-11-19 03:39:12 170

原创 How to test if your ros2 humble is setup

2. source。

2024-11-12 22:56:10 250

原创 [Installation Handbbok]SimpleScreenRecorder, a screen recording tool for Linux

【代码】SimpleScreenRecorder, a screen recording tool for Linux。

2024-11-11 22:04:04 428

原创 English grammar: conditionals and subjunctive mood

Conditional or hypothetical situations are commonly expressed in English using conditional sentences, which are generally categorized into three main types based on their likelihood or time reference. Here’s an overview of each type with examples:Used for

2024-11-06 17:39:12 999

原创 [Installation Handbook]How to install .tar.gz file on ubuntu

To install software from a file, follow these general steps:Download and extract the file:This command extracts the contents into a directory named after the file.Navigate to the extracted directory:Read any provided documentation (usually a or file) f

2024-10-29 00:19:35 413

原创 ROS noetic rosdep init/update

【代码】ROS noetic rosdep init/update。

2024-10-28 16:26:54 107

原创 How to verify your CUDA and pytorch

【代码】How to verify your CUDA and pytorch。

2024-10-22 19:19:11 333

原创 Getting started in Reinforcement Learning

(5)

2024-10-10 16:13:34 225

原创 [Installation Handbook]Install STM32CubeIDE on Ubuntu

1. Download the installer from official website:https://www.st.com/en/development-tools/stm32cubeide.html2. Navigate to the Download Directory: If the file is in your Downloads folder, run:3. Unzip the File: Use the command to extract the contents of t

2024-10-10 16:05:36 651

原创 [GPU Driver] [Installation Handbook] Install GPU Driver Manually for NVIDIA GPUs on Ubuntu

【代码】Install GPU Driver Manually for NVIDIA GPUs。

2024-10-10 15:52:18 661

原创德语语法汇总

die langsame Frau (主格), die langsame Frau (宾格), der langsamen Frau (与格), der langsamen Frau (属格)人称代词在德语中用于代替具体的人或事物，类似于英语中的 I, you, he, she, it, we, they。：die schöne Frau (主格), die schöne Frau (宾格), der schönen Frau (与格), der schönen Frau (属格)

2024-07-15 11:22:34 3051

原创【强化学习】强化学习中的重要概念

折扣因子是强化学习中的关键参数，通过调整折扣因子，代理可以在短期利益和长期收益之间找到平衡，从而制定出最优的策略。选择适当的折扣因子对于实现强化学习算法的有效性和稳定性至关重要。

2024-06-24 16:57:24 1070

原创人工智能和机器人的未来愿景 Viewpoints in AI and Robot

Andrew。

2024-06-20 20:23:03 1796

原创【英语学习】Awsome English Learning Resource 不定期更新~

https://zhuanlan.zhihu.com/p/366645568

2024-06-18 10:50:28 401

原创 Isaac Sim

Isaac Sim 学习。

2024-06-18 10:49:38 432

原创【深度学习入门：基于Python的理论与实现图书电子版及各章代码】解决手写数字识别MNIST数据集无法访问问题

将四个文件放在了https://github.com/ZhangXinNan/deep_learning_from_scratch的dataset/下。为了进一步验证，运行其他代码中需要访问dataset并调用dataset/mnist.py中函数的程序，例如，运行ch03/mnist_show.py。为了测试修改后的代码能否对mnist数据集进行正确的处理，运行dataset/mnist.py。，而数据无法直接访问，提示以下错误，所以运行代码也会出现问题。的代码，但代码需要访问数据集。

2024-06-11 17:17:59 2786 7

原创常用的 conda 指令

以下是一些常用的conda。

2024-06-07 15:08:06 401

原创【朱吉鸿Seminar 交流记录】机器人操作可变形物体：机遇与挑战对话机器人顶级会议和期刊R-AL IROS ICRA副主编朱吉鸿

A6: 这是一个很好的问题，然后我不知道能不能解答你，但我自己的认知是这样的：首先，我的观点是机器人不是一个科学，他更多的是工程，我做的东西他必须给人的生活创造价值，就是我认为他的应用是很重要的，去自动化某些过程让人感觉他是有用的。我的观点是大模型可以做一些比较上层的类似于planning的工作，这是我的想法，我认为大模型去规划一个步骤，你给机器人说做什么事情，大模型可以帮你分为更小对的步骤，但是每个小步骤还是需要一些底层的算法来执行的，这个执行端来完成。A2: 是的，这是有个很好的问题。

2024-06-05 15:59:19 553

原创大语言模型（Large Language Model，LLM）的应用探索最先进的大语言模型的功能以及潜在和现有的工业应用持续更新

这提高了许多行业对文本生成潜力的认识，从制造业到软件，在这些行业中，文章、新闻、营销口号、社交帖子、会议记录、客户反馈或产品描述等内容的生成是日常工作不可或缺的一部分。单击下面的交互式环境以浏览不同的行业并找到与您的兴趣相关的示例。Luminous是由Aleph Alpha开发的一系列大型语言模型，它采用了一种不同的方法：其基本和高级版本可以基于现有图像阅读、解释和生成文本，而不是从文本生成图像。特别是，最新的基于转换器的模型在基于文本的提示生成具有广泛内容、样式和变化的逼真图像方面显示出显着改进。

2024-05-27 20:32:06 682

常用的 conda 指令：conda cheat sheet

空空如也