- 博客(46)
- 收藏
- 关注
原创 IsaacLab Installation on Ubuntu 20.04, train a robot!
【代码】IsaacLab Installation on Ubuntu 20.04, train a robot!
2025-02-12 18:08:37
672
原创 [Deep Learning]ANN,CNN,RNN and LSTM
Reference:ANN vs. CNN vs. RNN vs. LSTM: Understanding the Differences in Neural Networks
2025-01-10 17:56:29
370
原创 [Git Notes] Keep the file on your local machine but remove it from the repository
【代码】[Git] Keep the file on your local machine but remove it from the repository。
2025-01-10 17:08:58
158
原创 [Transformer] The Structure of GPT, Generative Pretrained Transformer
The Structure of Generative Pretrained TransformerReference:The Transformer architecture of GPT modelsHow GPT Models Work
2025-01-10 17:01:37
513
原创 [Transformer] Architecture Overview
But what is a GPT? Visual intro to Transformers | Deep learning, chapter 5Visualizing transformers and attention | Talk for TNG Big Tech Day '24
2025-01-09 15:35:48
228
原创 [Reinforcement Learning] How Proximal Policy Optimization (PPO) updates the policy
CLIPθEtminrtθAtcliprtθ1−ϵ1ϵAtWhere:rtθπoldat∣stπθat∣stAttϵrtθnewθoldα∇θLCLIPθHere, αtθtoldAtLCLInewθoldα∇θJθoldHere, αθJ。
2024-12-26 05:43:49
571
原创 [Reinforcement Learning] The log-derivative trick and its application in policy gradient
gi1∏ngixi1∑nloggixxis:dxdlogfxfxf′xxix:fxi1∏ngixxgives:logfxi1∑nloggixxdlogfxi1∑ngixgi′xθa∣sin state sθθEτ∼πθRτ)]where ττθθ。
2024-12-25 17:58:41
686
原创 [Reinforment Learning] Epoch and timestep
【代码】[Reinforment Learning] Epoch and timestep。
2024-12-20 18:22:49
450
原创 [Reinforcement Learning] Agent,Environment, Value,Policy, Actor, Critic, Player, Model
【代码】[Reinforcement Learning] Agent,Environment, Value,Policy, Actor, Critic, Player, Model。
2024-12-20 18:17:23
714
原创 [Git notes] git reset options and branch history backup
【代码】[Git notes] git reset options and branch history backup。
2024-12-18 18:15:40
1153
原创 [Git Notes] Git push conflict resolution.
【代码】[Git Notes] Git push conflict resolution.
2024-12-18 16:27:35
946
原创 [Installation Handbook]uninstall pip and then install miniforge on ubuntu
To uninstall and then install on an Ubuntu system, you can follow these steps. Keep in mind that completely removing can break dependencies for Python packages and is generally not recommended unless you're sure about the implications. Here’s how you ca
2024-12-17 16:05:12
725
原创 [Installation handbook] Obsidian--Sharpen your thinking.
【代码】[Installation handbook] Obsidian--Sharpen your thinking.
2024-12-10 17:52:41
920
原创 [GPU Driver] How to find the corresponding GPU driver version for your GPU on Ubuntu
【代码】[GPU Driver] How to find the corresponding GPU driver version for your GPU on Ubuntu。
2024-12-04 21:01:50
733
原创 [GPU Driver]Purge the mismatched drivers and install the matched one to solve Version Mismatch
nvidia-smi。
2024-12-04 20:18:25
910
原创 [张量 Tensor] 基于Python计算的张量维度或轴的实际意义 The dimensions or axes of a tensor in Python(Numpy, PyTorch)
【代码】[Tensor] The dimensions or axes of a tensor in Python(Numpy, PyTorch)
2024-11-25 18:01:08
1050
原创 What is a Tensor?
WTF is a Tensor? What is the difference between tensors and matrixes?
2024-11-22 22:15:28
243
原创 How to paint colors to the assets cube through .urdf
【代码】How to paint colors to the assets cube through .urdf。
2024-11-19 17:39:54
158
原创 Another git process seems to be running in this repository, e.g.an editor opened by ‘git commit‘.
【代码】Another git process seems to be running in this repository, e.g.an editor opened by ‘git commit‘.
2024-11-19 03:39:12
170
原创 [Installation Handbbok]SimpleScreenRecorder, a screen recording tool for Linux
【代码】SimpleScreenRecorder, a screen recording tool for Linux。
2024-11-11 22:04:04
428
原创 English grammar: conditionals and subjunctive mood
Conditional or hypothetical situations are commonly expressed in English using conditional sentences, which are generally categorized into three main types based on their likelihood or time reference. Here’s an overview of each type with examples:Used for
2024-11-06 17:39:12
999
原创 [Installation Handbook]How to install .tar.gz file on ubuntu
To install software from a file, follow these general steps:Download and extract the file:This command extracts the contents into a directory named after the file.Navigate to the extracted directory:Read any provided documentation (usually a or file) f
2024-10-29 00:19:35
413
原创 How to verify your CUDA and pytorch
【代码】How to verify your CUDA and pytorch。
2024-10-22 19:19:11
333
原创 [Installation Handbook]Install STM32CubeIDE on Ubuntu
1. Download the installer from official website:https://www.st.com/en/development-tools/stm32cubeide.html2. Navigate to the Download Directory: If the file is in your Downloads folder, run:3. Unzip the File: Use the command to extract the contents of t
2024-10-10 16:05:36
651
原创 [GPU Driver] [Installation Handbook] Install GPU Driver Manually for NVIDIA GPUs on Ubuntu
【代码】Install GPU Driver Manually for NVIDIA GPUs。
2024-10-10 15:52:18
661
原创 德语语法汇总
die langsame Frau (主格), die langsame Frau (宾格), der langsamen Frau (与格), der langsamen Frau (属格)人称代词在德语中用于代替具体的人或事物,类似于英语中的 I, you, he, she, it, we, they。:die schöne Frau (主格), die schöne Frau (宾格), der schönen Frau (与格), der schönen Frau (属格)
2024-07-15 11:22:34
3051
原创 【强化学习】强化学习中的重要概念
折扣因子是强化学习中的关键参数,通过调整折扣因子,代理可以在短期利益和长期收益之间找到平衡,从而制定出最优的策略。选择适当的折扣因子对于实现强化学习算法的有效性和稳定性至关重要。
2024-06-24 16:57:24
1070
原创 【英语学习】Awsome English Learning Resource 不定期更新~
https://zhuanlan.zhihu.com/p/366645568
2024-06-18 10:50:28
401
原创 【深度学习入门:基于Python的理论与实现图书电子版及各章代码】解决手写数字识别MNIST数据集无法访问问题
将四个文件放在了https://github.com/ZhangXinNan/deep_learning_from_scratch的dataset/下。为了进一步验证,运行其他代码中需要访问dataset并调用dataset/mnist.py中函数的程序,例如,运行ch03/mnist_show.py。为了测试修改后的代码能否对mnist数据集进行正确的处理,运行dataset/mnist.py。,而数据无法直接访问,提示以下错误,所以运行代码也会出现问题。的代码,但代码需要访问数据集。
2024-06-11 17:17:59
2786
7
原创 【朱吉鸿Seminar 交流记录】机器人操作可变形物体:机遇与挑战 对话机器人顶级会议和期刊R-AL IROS ICRA副主编朱吉鸿
A6: 这是一个很好的问题,然后我不知道能不能解答你,但我自己的认知是这样的:首先,我的观点是机器人不是一个科学,他更多的是工程,我做的东西他必须给人的生活创造价值,就是我认为他的应用是很重要的,去自动化某些过程让人感觉他是有用的。我的观点是大模型可以做一些比较上层的类似于planning的工作,这是我的想法,我认为大模型去规划一个步骤,你给机器人说做什么事情,大模型可以帮你分为更小对的步骤,但是每个小步骤还是需要一些底层的算法来执行的,这个执行端来完成。A2: 是的,这是有个很好的问题。
2024-06-05 15:59:19
553
原创 大语言模型(Large Language Model,LLM)的应用 探索最先进的大语言模型的功能以及潜在和现有的工业应用 持续更新
这提高了许多行业对文本生成潜力的认识,从制造业到软件,在这些行业中,文章、新闻、营销口号、社交帖子、会议记录、客户反馈或产品描述等内容的生成是日常工作不可或缺的一部分。单击下面的交互式环境以浏览不同的行业并找到与您的兴趣相关的示例。Luminous是由Aleph Alpha开发的一系列大型语言模型,它采用了一种不同的方法:其基本和高级版本可以基于现有图像阅读、解释和生成文本,而不是从文本生成图像。特别是,最新的基于转换器的模型在基于文本的提示生成具有广泛内容、样式和变化的逼真图像方面显示出显着改进。
2024-05-27 20:32:06
682
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人