← Back to main

Exp14 Step 3: Full Dataset + WINDOW=8 + 32×32

Step 2 대비: 데이터 45→150 ep, WINDOW 3→8, image 16²→32² grayscale.
입력 dim: 268 → 1056 floats. MLP: 512→256→128→8.

Step 2 (baseline)
75.9%
Step 2 reference (45 ep, W=3)
Step 3 PM
77.0%
(403/525)
데이터
2626
windows (150 ep)

PM per Path Type (test split)

Path TypeCorrect/TotalPM
center_straight47/5683.9%
center_left33/5461.1%
center_right25/5446.3%
left_straight68/7294.4%
left_left39/5570.9%
left_right42/5773.7%
right_straight71/7298.6%
right_left40/5770.2%
right_right39/4881.2%

End-to-End Gate Check (Exp17 / Exp18)

ModelPMClosed-loopFPEInterpretation
Exp14 Step275.9%66.7%0.55mStrongest practical baseline
Exp1776.95%11.1%1.04mPM only improved, rollout failed
Exp1827.62%11.1%1.04mText-fusion gate failed
설정
입력: WINDOW=8 × (cx, cy, area, has_bbox) + 32×32 grayscale image
출력: 8-class discrete action | epochs=300 | AdamW lr=2e-3
Exp18 실패 패턴 요약
best checkpoint는 val_loss 1.325였지만, 실제 PM은 27.62%, closed-loop는 11.1%에 그쳤습니다.
PM confusion 기준으로는 FORWARD 95개가 전부 FWD+R로 오분류됐고, closed-loop에서는 center_right만 성공했습니다.
즉 text embedding fusion은 현재 end-to-end instability를 해결하지 못했고, strongest baseline은 여전히 Exp14 Step2입니다.