Lü Shuai, 吕帅

在读研究生


周瑞凯,男,瑶族,1992年07月生,广西壮族自治区桂林市人。

【学术论文】在国内外期刊和会议上发表学术论文1篇,在审学术论文3篇。

  1. Zhou Ruikai, Li Songlin, Lü Shuai*. From simple to complex: Mitigating the impact of critic accuracy fluctuations by multi-agent. 2025. (Submitted)
  2. Zhou Ruikai, Zhong Taihong, Li Songlin, Lü Shuai*. A Kullback-Leibler divergence perspective on policy gradient methods in reinforcement learning. 2025. (Submitted)
  3. Zhou Ruikai, Zhong Taihong, Zhu Wenbo, Han Shuai, Lü Shuai*. Influence of Gaussian distribution on performance metrics in continuous reinforcement learning: A case study. 2025. (Submitted)
  4. Zhou Ruikai, Zhu Wenbo, Han Shuai, Kang Meng, Lü Shuai*. VCSAP: Online reinforcement learning exploration method based on visitation count of state-action pairs. Neural Networks, 2025, 184: 107052. (中科院1区TOP期刊, CCF推荐B类期刊, SCI, 目前IF: 6.0)

【荣誉奖励】

【联系方式】


方文思,女,1999年01月生,吉林省公主岭市人。

【学术论文】在国内外期刊和会议上发表学术论文3篇,在审学术论文3篇。

  1. Lü Shuai, Yuan Jianhui, Zhang Xinyu, Zhang Shaojie, Fang Wensi, Li Jingyao*. Pre-trained initialization and memory-enhanced correction for source-free universal domain adaptation. 2025. (Submitted)
  2. Li Ying, Fang Wensi, Jiang Xuyang, Sun Hang, Li Linlin, Du Wei*. MGRFE-web: A web server for molecular target identification of Alzheimer’s disease based on feature selection. 2023. (Submitted)
  3. Li Ying, Fang Wensi, Zhao Jianing, Yang Xiao, Sun Hang, Du Wei*. Training an end-to-end moonlighting long non-coding RNAs deep learning model based on reinforcement learning. 2023. (Submitted)
  4. Li Ying, Sun Hang, Fang Wensi, Ma Qin, Han Siyu, Rui Wang-Sattler, Du Wei*, Yu Qiong*. SURE: Screening unlabeled samples for reliable negative samples based on reinforcement learning. Information Sciences, 2023, 629: 299-312. (中科院1区TOP期刊, CCF推荐B类期刊, SCI)
  5. Li Ying, Fang Wensi, Sun Hang, Liu Xiangyu, Du Wei, Liu Yijun, Li Qianqian*. PecidRL: Petition expectation correction and identification based on deep reinforcement learning. Information Processing and Management, 2023, 60(3): 103285. (中科院1区TOP期刊, CCF推荐B类期刊, SCI, IF: 7.4)
  6. Han Siyu, Yang Xiao, Sun Hang, Yang Hu, Zhang Qi, Peng Cheng, Fang Wensi, Li Ying*. LION: An integrated R package for effective prediction of ncRNA–protein interaction. Briefings in Bioinformatics, 2022, 23(6): bbac420. (中科院1区期刊, CCF推荐B类期刊, SCI, IF: 9.5)

【学位论文】

  1. 方文思. 基于强化学习的政府留言板标签更正与识别算法研究[硕士学位论文]. 长春: 吉林大学, 2023.

【荣誉奖励】

【联系方式】


廉筱峪,男,满族,2001年07月生,辽宁省抚顺市人。

【学术论文】在国内外期刊和会议上发表学术论文2篇,在审学术论文1篇。

  1. Lian Xiaoyu, Xia Nan*, Dai Gaole, Yang Hongqin. A dual-branch deep interaction network for multi-channel speech enhancement. 2024. (Submitted)
  2. Lian Xiaoyu, Xia Nan*, Dai Gaole, Yang Hongqin. An efficient joint training model for monaural noisy-reverberant speech recognition. Applied Acoustics, 2025, 228: 110322. (中科院2区期刊, SCI, 目前IF: 3.4)
  3. 廉筱峪, 夏楠*, 戴高乐, 杨红琴. 复杂噪声环境下基于轻量化模型的车内交互语音增强和识别方法. 电子学报, 2024, 52(4): 1282-1287. (CCF推荐中文A类期刊)

【荣誉奖励】

【联系方式】


朱文博,女,2000年08月生,吉林省长春市人。

【学术论文】在国内外期刊和会议上发表学术论文2篇,在审学术论文6篇。

  1. Zhu Wenbo, Xiao Wei, Lü Shuai*. Soft-penalty guided exploration in reinforcement learning. 2025. (Submitted)
  2. Long Zehong, Zhu Wenbo, Zhang Yushu, Lü Shuai*, Lin Dajun. Efficient exploration via state distribution discrepancy maximization in deep reinforcement learning. 2024. (Submitted)
  3. Zhu Wenbo, Lü Shuai*, Long Zehong, Wu Junhong. Feature distillation for exploration in reinforcement learning. 2023. (Submitted)
  4. Zhou Ruikai, Zhong Taihong, Zhu Wenbo, Han Shuai, Lü Shuai*. Influence of Gaussian distribution on performance metrics in continuous reinforcement learning: A case study. 2025. (Submitted)
  5. Long Zehong, Zhu Wenbo, Lü Shuai*, Wu Junhong, Zhong Taihong. Breaking the sample efficiency barrier by rethinking experience replay. 2025. (Submitted)
  6. Zhu Sheng, Wu Hao, Shen Chun, Zhu Wenbo, Han Shuai, Lü Shuai*. Actor-critic of multi-agent collaboration on single-agent task. 2025. (Submitted)
  7. Zhou Ruikai, Zhu Wenbo, Han Shuai, Kang Meng, Lü Shuai*. VCSAP: Online reinforcement learning exploration method based on visitation count of state-action pairs. Neural Networks, 2025, 184: 107052. (中科院1区TOP期刊, CCF推荐B类期刊, SCI, 目前IF: 6.0)
  8. Li Jingyao, Lü Shuai, Zhu Wenbo, Li Zhanshan*. Enhancing transferability and discriminability simultaneously for unsupervised domain adaptation. Knowledge-Based Systems, 2022, 247: 108705. (中科院1区TOP期刊, CCF推荐C类期刊, SCI, IF: 8.8)

【学位论文】

  1. 朱文博. 基于特征蒸馏和软惩罚引导的强化学习探索方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


张鑫宇,男,1999年04月生,黑龙江省齐齐哈尔市人。

【学术论文】在国内外期刊和会议上发表学术论文2篇,在审学术论文2篇。

  1. Lü Shuai, Yuan Jianhui, Zhang Xinyu, Zhang Shaojie, Fang Wensi, Li Jingyao*. Pre-trained initialization and memory-enhanced correction for source-free universal domain adaptation. 2025. (Submitted)
  2. Lü Shuai, Zhang Xinyu, Li Zongze, Li Jingyao*, Kang Meng. Bi-classifier with neighborhood aggregation for unsupervised domain adaptation. 2024. (Submitted)
  3. Zhang Xinyu, Kang Meng, Lü Shuai*. Low category uncertainty and high training potential instance learning for unsupervised domain adaptation. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024, 16881-16889. (CCF推荐A类会议)
  4. Lü Shuai, Li Zongze, Zhang Xinyu, Li Jingyao*. Consistency regularization-based mutual alignment for source-free domain adaptation. Expert Systems with Applications, 2024, 241: 122577. (中科院1区TOP期刊, CCF推荐C类期刊, SCI, 目前IF: 7.5)

【学位论文】

  1. 张鑫宇. 基于自监督学习的无监督领域自适应方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


张泽宇,男,2000年09月生,山东省滨州市人。

【学术论文】在国内外期刊和会议上发表学术论文2篇,在审学术论文1篇。

  1. Zhang Zeyu, Shen Chun, Ma Qiang, Kang Meng, Lü Shuai*. Prototype-driven active domain adaptation with density consideration. 2025. (Submitted)
  2. Zhang Zeyu, Shen Chun, Lü Shuai*, Zhang Shaojie. Reconfigurability-aware selection for contrastive active domain adaptation. In: Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju Island, South Korea, August 3-9, 2024, 5545-5553. (CCF推荐A类会议)
  3. Zhang Shaojie, Shen Chun, Lü Shuai*, Zhang Zeyu. Reviewing the forgotten classes for domain adaptation of black-box predictors. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024, 16830-16837. (CCF推荐A类会议)

【学位论文】

  1. 张泽宇. 基于主动学习的领域自适应方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


林炟君,女,2000年08月生,福建省莆田市人。

【学术论文和发明专利】在国内外期刊和会议上发表学术论文0篇,在审学术论文3篇,授权发明专利1项。

  1. Lin Dajun, Li Songlin, Lü Shuai*, Zhou Wenbo*, Zhong Taihong, An Daolong. WCPC-TD3: Weighted contrastive policy constraint for offline reinforcement learning. 2025. (Submitted)
  2. Zhong Taihong, Lü Shuai*, Lin Dajun, An Daolong. Mild conservatism Q-learning with adaptive Q-ensemble. 2025. (Submitted)
  3. Long Zehong, Zhu Wenbo, Zhang Yushu, Lü Shuai*, Lin Dajun. Efficient exploration via state distribution discrepancy maximization in deep reinforcement learning. 2024. (Submitted)
  4. 吕帅, 龙泽泓, 钟太鸿, 林炟君. 一种基于SAC强化学习算法的智能运动控制方法. (专利号: ZL 2024 1 0726196.6, 授权公告日: 2024.08.13)

【学位论文】

  1. 林炟君. 基于策略约束的离线强化学习方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


张少杰,男,2000年04月生,安徽省合肥市人。

【学术论文】在国内外期刊和会议上发表学术论文2篇,在审学术论文1篇。

  1. Lü Shuai, Yuan Jianhui, Zhang Xinyu, Zhang Shaojie, Fang Wensi, Li Jingyao*. Pre-trained initialization and memory-enhanced correction for source-free universal domain adaptation. 2025. (Submitted)
  2. Zhang Zeyu, Shen Chun, Lü Shuai*, Zhang Shaojie. Reconfigurability-aware selection for contrastive active domain adaptation. In: Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), Jeju Island, South Korea, August 3-9, 2024, 5545-5553. (CCF推荐A类会议)
  3. Zhang Shaojie, Shen Chun, Lü Shuai*, Zhang Zeyu. Reviewing the forgotten classes for domain adaptation of black-box predictors. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024, 16830-16837. (CCF推荐A类会议)

【学位论文】

  1. 张少杰. 基于黑盒模型的无源领域自适应方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


钟太鸿,男,1999年11月生,辽宁省大连市人。

【学术论文和发明专利】在国内外期刊和会议上发表学术论文0篇,在审学术论文7篇,授权发明专利1项。

  1. Wu Hao, Li Songlin, Xiao Wei, Zhong Taihong, Lü Shuai*. Offline-to-online reinforcement learning with triple-intensity policy constraints. 2025. (Submitted)
  2. Lin Dajun, Li Songlin, Lü Shuai*, Zhou Wenbo*, Zhong Taihong, An Daolong. WCPC-TD3: Weighted contrastive policy constraint for offline reinforcement learning. 2025. (Submitted)
  3. Zhong Taihong, Lü Shuai*, Lin Dajun, An Daolong. Mild conservatism Q-learning with adaptive Q-ensemble. 2025. (Submitted)
  4. Zhou Ruikai, Zhong Taihong, Li Songlin, Lü Shuai*. A Kullback-Leibler divergence perspective on policy gradient methods in reinforcement learning. 2025. (Submitted)
  5. Zhou Ruikai, Zhong Taihong, Zhu Wenbo, Han Shuai, Lü Shuai*. Influence of Gaussian distribution on performance metrics in continuous reinforcement learning: A case study. 2025. (Submitted)
  6. Zhong Taihong, Han Shuai, Zhang Yushu, Long Zehong, Lü Shuai*, Wu Junhong. TATRC: Triple actor-critic structure with regularization for better performance. 2025. (Submitted)
  7. Long Zehong, Zhu Wenbo, Lü Shuai*, Wu Junhong, Zhong Taihong. Breaking the sample efficiency barrier by rethinking experience replay. 2025. (Submitted)
  8. 吕帅, 龙泽泓, 钟太鸿, 林炟君. 一种基于SAC强化学习算法的智能运动控制方法. (专利号: ZL 2024 1 0726196.6, 授权公告日: 2024.08.13)

【学位论文】

  1. 钟太鸿. 基于分布偏移的深度强化学习方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


吴珺泓,男,2000年09月生,山东省莱西市人。

【学术论文】在国内外期刊和会议上发表学术论文2篇,在审学术论文6篇。

  1. Wu Junhong, Liu Jie, Lü Shuai*. Alternate data augmentation for generalization in reinforcement learning. 2025. (Submitted)
  2. Wu Junhong, Liu Jie, Xiong Xi, An Daolong, Lü Shuai*. Focus on primary: Differential diverse data augmentation for generalization in visual reinforcement learning. 2025. (Submitted)
  3. Zhang Yushu, Shen Chun, An Daolong, Wu Junhong, Lü Shuai*. Reinforcement learning with extreme minimum distribution. 2025. (Submitted)
  4. Zhu Wenbo, Lü Shuai*, Long Zehong, Wu Junhong. Feature distillation for exploration in reinforcement learning. 2023. (Submitted)
  5. Zhong Taihong, Han Shuai, Zhang Yushu, Long Zehong, Lü Shuai*, Wu Junhong. TATRC: Triple actor-critic structure with regularization for better performance. 2025. (Submitted)
  6. Long Zehong, Zhu Wenbo, Lü Shuai*, Wu Junhong, Zhong Taihong. Breaking the sample efficiency barrier by rethinking experience replay. 2025. (Submitted)
  7. Xiong Xi, Shen Chun, Wu Junhong, Lü Shuai*, Zhang Xiaodan. Combined data augmentation framework for generalizing deep reinforcement learning from pixels. Expert Systems with Applications, 2025, 264: 125810. (中科院1区TOP期刊, CCF推荐C类期刊, SCI, 目前IF: 7.5)
  8. Zhu Sheng, Shen Chun, Lü Shuai*, Wu Junhong, An Daolong. Double buffers CEM-TD3: More efficient evolution and richer exploration. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024, 17193-17201. (CCF推荐A类会议)

【学位论文】

  1. 吴珺泓. 基于数据增强的可泛化视觉强化学习研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


安道龙,男,1998年10月生,河南省濮阳市人。

【学术论文】在国内外期刊和会议上发表学术论文1篇,在审学术论文8篇。

  1. Xiao Wei, Li Songlin, An Daolong, Wu Hao, Zhang Xiaodan, Lü Shuai*. Corrected critic and adaptive constraint for offline-to-online reinforcement learning. 2025. (Submitted)
  2. Li Songlin, Xiao Wei, Wu Hao, Zhang Xiaodan, An Daolong, Lü Shuai*. State proficiency-based adaptive fine-tuning for offline-to-online reinforcement learning. 2025. (Submitted)
  3. An Daolong, Shen Chun, Li Songlin, Xiao Wei, Lü Shuai*, Zhou Wenbo*. Result constraint behavior clone for offline reinforcement learning. 2025. (Submitted)
  4. Lin Dajun, Li Songlin, Lü Shuai*, Zhou Wenbo*, Zhong Taihong, An Daolong. WCPC-TD3: Weighted contrastive policy constraint for offline reinforcement learning. 2025. (Submitted)
  5. Zhong Taihong, Lü Shuai*, Lin Dajun, An Daolong. Mild conservatism Q-learning with adaptive Q-ensemble. 2025. (Submitted)
  6. Wu Junhong, Liu Jie, Xiong Xi, An Daolong, Lü Shuai*. Focus on primary: Differential diverse data augmentation for generalization in visual reinforcement learning. 2025. (Submitted)
  7. Zhang Yushu, Shen Chun, An Daolong, Wu Junhong, Lü Shuai*. Reinforcement learning with extreme minimum distribution. 2025. (Submitted)
  8. Shu Man, Lü Shuai*, Gong Xiaoyu, An Daolong, Li Songlin. Episodic memory-double actor-critic twin delayed deep deterministic policy gradient. 2024. (Submitted)
  9. Zhu Sheng, Shen Chun, Lü Shuai*, Wu Junhong, An Daolong. Double buffers CEM-TD3: More efficient evolution and richer exploration. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024, 17193-17201. (CCF推荐A类会议)

【学位论文】

  1. 安道龙. 基于结果约束的离线强化学习方法研究[硕士学位论文]. 长春: 吉林大学, 2025.

【荣誉奖励】

【联系方式】


李松霖,男,2000年12月生,吉林省长春市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文8篇。

  1. Xiao Wei, Li Songlin, An Daolong, Wu Hao, Zhang Xiaodan, Lü Shuai*. Corrected critic and adaptive constraint for offline-to-online reinforcement learning. 2025. (Submitted)
  2. Li Songlin, Xiao Wei, Wu Hao, Zhang Xiaodan, An Daolong, Lü Shuai*. State proficiency-based adaptive fine-tuning for offline-to-online reinforcement learning. 2025. (Submitted)
  3. Wu Hao, Li Songlin, Xiao Wei, Zhong Taihong, Lü Shuai*. Offline-to-online reinforcement learning with triple-intensity policy constraints. 2025. (Submitted)
  4. An Daolong, Shen Chun, Li Songlin, Xiao Wei, Lü Shuai*, Zhou Wenbo*. Result constraint behavior clone for offline reinforcement learning. 2025. (Submitted)
  5. Lin Dajun, Li Songlin, Lü Shuai*, Zhou Wenbo*, Zhong Taihong, An Daolong. WCPC-TD3: Weighted contrastive policy constraint for offline reinforcement learning. 2025. (Submitted)
  6. Zhou Ruikai, Li Songlin, Lü Shuai*. From simple to complex: Mitigating the impact of critic accuracy fluctuations by multi-agent. 2025. (Submitted)
  7. Zhou Ruikai, Zhong Taihong, Li Songlin, Lü Shuai*. A Kullback-Leibler divergence perspective on policy gradient methods in reinforcement learning. 2025. (Submitted)
  8. Shu Man, Lü Shuai*, Gong Xiaoyu, An Daolong, Li Songlin. Episodic memory-double actor-critic twin delayed deep deterministic policy gradient. 2024. (Submitted)

【荣誉奖励】

【联系方式】


袁健会,男,1999年06月生,吉林省长春市人。

【学术论文】在国内外期刊和会议上发表学术论文1篇,在审学术论文1篇。

  1. Lü Shuai, Yuan Jianhui, Zhang Xinyu, Zhang Shaojie, Fang Wensi, Li Jingyao*. Pre-trained initialization and memory-enhanced correction for source-free universal domain adaptation. 2025. (Submitted)
  2. Li Zhuang, Yuan Jianhui, Li Guixiang, Wang Hao, Li Xingcan, Li Dan, Wang Xinhua*. RSI-YOLO: Object detection method for remote sensing images based on improved YOLO. Sensors, 2023, 23: 6414. (中科院2区期刊, SCI, IF: 3.4)

【荣誉奖励】

【联系方式】


肖威,男,2001年11月生,山东省菏泽市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文5篇。

  1. Xiao Wei, Li Songlin, An Daolong, Wu Hao, Zhang Xiaodan, Lü Shuai*. Corrected critic and adaptive constraint for offline-to-online reinforcement learning. 2025. (Submitted)
  2. Li Songlin, Xiao Wei, Wu Hao, Zhang Xiaodan, An Daolong, Lü Shuai*. State proficiency-based adaptive fine-tuning for offline-to-online reinforcement learning. 2025. (Submitted)
  3. Wu Hao, Li Songlin, Xiao Wei, Zhong Taihong, Lü Shuai*. Offline-to-online reinforcement learning with triple-intensity policy constraints. 2025. (Submitted)
  4. An Daolong, Shen Chun, Li Songlin, Xiao Wei, Lü Shuai*, Zhou Wenbo*. Result constraint behavior clone for offline reinforcement learning. 2025. (Submitted)
  5. Zhu Wenbo, Xiao Wei, Lü Shuai*. Soft-penalty guided exploration in reinforcement learning. 2025. (Submitted)

【荣誉奖励】

【联系方式】


李贵祥,男,2003年04月生,山东省聊城市人。

【学术论文】在国内外期刊和会议上发表学术论文1篇,在审学术论文1篇。

  1. Li Zhuang, Li Guixiang, Song Xiangyang, Wang Xinhua*. An efficient and dynamic framework for multi-scale target detection of underwater organisms: EVD-YOLO. 2024. (Submitted)
  2. Li Zhuang, Yuan Jianhui, Li Guixiang, Wang Hao, Li Xingcan, Li Dan, Wang Xinhua*. RSI-YOLO: Object detection method for remote sensing images based on improved YOLO. Sensors, 2023, 23: 6414. (中科院2区期刊, SCI, IF: 3.4)

【荣誉奖励】

【联系方式】


吴昊,男,2002年02月生,内蒙古自治区额尔古纳市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文4篇。

  1. Xiao Wei, Li Songlin, An Daolong, Wu Hao, Zhang Xiaodan, Lü Shuai*. Corrected critic and adaptive constraint for offline-to-online reinforcement learning. 2025. (Submitted)
  2. Li Songlin, Xiao Wei, Wu Hao, Zhang Xiaodan, An Daolong, Lü Shuai*. State proficiency-based adaptive fine-tuning for offline-to-online reinforcement learning. 2025. (Submitted)
  3. Wu Hao, Li Songlin, Xiao Wei, Zhong Taihong, Lü Shuai*. Offline-to-online reinforcement learning with triple-intensity policy constraints. 2025. (Submitted)
  4. Zhu Sheng, Wu Hao, Shen Chun, Zhu Wenbo, Han Shuai, Lü Shuai*. Actor-critic of multi-agent collaboration on single-agent task. 2025. (Submitted)

【荣誉奖励】

【联系方式】


孙耕浩,男,2001年07月生,山东省德州市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文1篇。

  1. Chen Huangyang, Chen Juan, Zhang Tao, Sun Genghao, Lü Shuai*. Reward remodeling based on trajectory return from offline dataset in offline reinforcement learning. 2025. (Submitted)

【荣誉奖励】

【联系方式】


章晓丹,女,2002年01月生,山东省威海市人。

【学术论文】在国内外期刊和会议上发表学术论文1篇,在审学术论文2篇。

  1. Xiao Wei, Li Songlin, An Daolong, Wu Hao, Zhang Xiaodan, Lü Shuai*. Corrected critic and adaptive constraint for offline-to-online reinforcement learning. 2025. (Submitted)
  2. Li Songlin, Xiao Wei, Wu Hao, Zhang Xiaodan, An Daolong, Lü Shuai*. State proficiency-based adaptive fine-tuning for offline-to-online reinforcement learning. 2025. (Submitted)
  3. Xiong Xi, Shen Chun, Wu Junhong, Lü Shuai*, Zhang Xiaodan. Combined data augmentation framework for generalizing deep reinforcement learning from pixels. Expert Systems with Applications, 2025, 264: 125810. (中科院1区TOP期刊, CCF推荐C类期刊, SCI, 目前IF: 7.5)

【荣誉奖励】

【联系方式】


陈黄洋,男,2002年08月生,福建省漳州市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文1篇。

  1. Chen Huangyang, Chen Juan, Zhang Tao, Sun Genghao, Lü Shuai*. Reward remodeling based on trajectory return from offline dataset in offline reinforcement learning. 2025. (Submitted)

【荣誉奖励】

【联系方式】


张涛,男,2002年10月生,河南省濮阳市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文1篇。

  1. Chen Huangyang, Chen Juan, Zhang Tao, Sun Genghao, Lü Shuai*. Reward remodeling based on trajectory return from offline dataset in offline reinforcement learning. 2025. (Submitted)

【荣誉奖励】

【联系方式】


檀磊,男,2000年11月生,安徽省安庆市人。

【学术论文和发明专利】在国内外期刊和会议上发表学术论文1篇,在审学术论文0篇,申请发明专利(目前实质审查)1项。

  1. 马慧敏*, 檀磊, 张京会, 张鹏飞, 宁孝梅, 刘海秋, 高彦伟. 基于深度学习的合成孔径成像系统共相误差检测研究综述. 量子电子学报, 2022, 39(6): 927-941. (第一作者为指导教师)
  2. 檀磊, 马慧敏, 王小申, 戴明宇, 代腾辉, 焦俊, 刘倩, 辜丽川. 基于多尺度生成对抗网络的大气湍流图像复原方法及系统. (申请号: CN2023 1 1725750.0, 申请日: 2023.12.14, 目前实质审查)

【荣誉奖励】

【联系方式】


侯志斌,男,1999年08月生,山东省菏泽市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文0篇。

【荣誉奖励】

【联系方式】


张顺浩,男,2002年03月生,山东省济南市人。

【学术论文】在国内外期刊和会议上发表学术论文0篇,在审学术论文0篇。

【荣誉奖励】

【联系方式】


巩锦程,男,2002年09月生,山东省淄博市人。

【荣誉奖励】

【联系方式】


甄德杰,男,2003年05月生,河北省邢台市人。

【荣誉奖励】

【联系方式】


钟金运,男,2003年06月生,江西省瑞金市人。

【荣誉奖励】

【联系方式】


常钰,女,2003年01月,辽宁省大连市人。

【荣誉奖励】

【联系方式】


姜文康,男,2003年07月,山东省德州市人。

【荣誉奖励】

【联系方式】