feat(data): parse 3-agency PDF transition matrices to CSV #task-290

- New: data/parse_pdf_matrices.py (KR/NICE/SCI PDF parser)
  - KR: text-based parser (space-separated numbers + dashes)
  - NICE: text-based parser (clean numeric format)
  - SCI: pdfplumber table extraction (column-position-aware)
  - WR redistribution, B이하→B mapping, CCC extrapolation from PD patterns
- Modified: data/transition_matrices.py (added source='real' loader)
- Modified: config.yaml (data.transition_source: 'real')
- Modified: main.py (reads transition source from config)
- Output: 112 CSV files (KR/NICE/SCI/AVG × 28 years)
This commit is contained in:
Variet Agent
2026-03-11 01:07:27 +09:00
parent ebdc6b805b
commit 8af743e6f3
116 changed files with 1714 additions and 3 deletions

9
data/real/KR_2025.csv Normal file
View File

@@ -0,0 +1,9 @@
,AAA,AA,A,BBB,BB,B,CCC,D
AAA,0.995674,0.004326,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000
AA,0.008638,0.970505,0.020436,0.000421,0.000000,0.000000,0.000000,0.000000
A,0.000000,0.046355,0.922115,0.027547,0.001328,0.002213,0.000000,0.000443
BBB,0.000000,0.000000,0.067727,0.879654,0.031095,0.014401,0.003000,0.004122
BB,0.000000,0.000000,0.002883,0.055046,0.796592,0.087759,0.009751,0.047969
B,0.000000,0.000000,0.000000,0.004327,0.012835,0.722757,0.127545,0.132535
CCC,0.000000,0.000000,0.001000,0.002000,0.003000,0.010268,0.617541,0.366190
D,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,1.000000
1 AAA AA A BBB BB B CCC D
2 AAA 0.995674 0.004326 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000
3 AA 0.008638 0.970505 0.020436 0.000421 0.000000 0.000000 0.000000 0.000000
4 A 0.000000 0.046355 0.922115 0.027547 0.001328 0.002213 0.000000 0.000443
5 BBB 0.000000 0.000000 0.067727 0.879654 0.031095 0.014401 0.003000 0.004122
6 BB 0.000000 0.000000 0.002883 0.055046 0.796592 0.087759 0.009751 0.047969
7 B 0.000000 0.000000 0.000000 0.004327 0.012835 0.722757 0.127545 0.132535
8 CCC 0.000000 0.000000 0.001000 0.002000 0.003000 0.010268 0.617541 0.366190
9 D 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 1.000000