P19E99
Home
Archive
About
GitHub
Home
Archive
About
GitHub
P19E99
Former competitive programmer, researching Adversarial Examples in AI Security
Categories
abc
1
dl
1
hot100
2
Interview Notes
13
leetcode
5
read paper
1
sec
1
test
1
More
Tags
abc
dl
hot100
Interview Notes
leetcode
read paper
sec
test
字节跳动推荐算法论文 HyFormer 精读(零基础友好)
2026-03-22
read paper
/
read paper
HyFormer: Revisiting the Roles of Sequence Modeling and Feature Interaction in CTR Prediction
3515 words
|
18 minutes
力扣 hot100 - 双指针
2026-03-12
hot100
/
hot100
hot100
868 words
|
4 minutes
RLHF(三):从 GRPO 到 DAPO
2026-03-11
Interview Notes
/
Interview Notes
GRPO and DAPO
1670 words
|
8 minutes
RLHF(二):PPO 代码以及细节补充
2026-02-28
Interview Notes
/
Interview Notes
PPO 代码
3451 words
|
17 minutes
力扣每日一题 20260226
2026-02-26
leetcode
/
leetcode
力扣每日一题 20260226
215 words
|
1 minute
RLHF(一):PPO 基本流程
2026-02-26
Interview Notes
/
Interview Notes
PPO 基本流程
1329 words
|
7 minutes
力扣每日一题 20260225
2026-02-25
leetcode
/
leetcode
力扣每日一题 20260225
75 words
|
1 minute
力扣每日一题 20260224
2026-02-24
leetcode
/
leetcode
力扣每日一题 20260224
281 words
|
1 minute
1
2
3
4