![]()
刘云龙

-
副教授
- 入职时间:2009-10-15
- 所在单位:航空航天学院
- 学历:博士研究生毕业
- 性别:男
- 学位:工学博士学位
- 在职信息:在职
访问量:
-
[11]基于循环卷积神经网络的POMDP值迭代算法.计算机工程,2020,6.
-
[12]刘云龙,预测状态表示模型的复位算法.计算机学报,(5):222-227.
-
[13]刘云龙,基于CMAC网络Sarsa(λ)学习的RoboCup守门员策略.北京工业大学学报,(9):74-78.
-
[14]刘云龙.Balancing exploration and exploitation in episodic reinforcement learning.Expert Systems with Applications,2023,231
-
[15]刘云龙,Sequential Decision Making with "Sequential Information" in Deep Reinforcement Learning.Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),13631 LNCS173-184.
-
[16]刘云龙,The treatment of sepsis: an episodic memory-assisted deep reinforcement learning approach.APPLIED INTELLIGENCE,2021,
-
[17]刘云龙.Hard Negative Sample Mining for Contrastive Representation in Reinforcement Learning.Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),13281 LNAI277-288.
-
[18]刘云龙.Deep Q-Network with Predictive State Models in Partially Observable Domains.Mathematical Problems in Engineering,2020
-
[19]刘云龙,Attention-based deep Q-network in complex systems.Communications in Computer and Information Science,1142 CCIS323-332.
-
[20]刘云龙,An improved relief feature selection algorithm based on Monte-Carlo tree search.Systems Science and Control Engineering,2019,7(1):304-310.