[DT] Model to System: The Age of Inference, Where we are going?

지능형산업융합서비스특론 수업 특강(8): 우아한 형제들, 이봉호

Posted May 9, 2025 Updated Jun 12, 2025

By Cheong seolmo

2 min read

Code

The Age of Inference is Coming
- 논문 ‘BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-big LLMs’: 부동소수점 데이터를 1bit만 사용해 성능 차이 없이 속도 향상
- 논문 ‘70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float’: 동적으로 부동소수점으로 구현해 데이터 압축
Data Centric AI
- 논문 ‘s1: Simple test-time scaling’: 데이터 자가 증식의 시대
- 논문 ‘Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies’: 데이터가 중요한 자율 주행
Proprietary Data / Sensitive Data
Non-Verifiable Data
- 우리의 삶에서 0과 1의 스케일로 구분 가능한 Task는 모두 AI가 대체 가능 (Verifiable)
- 우리는 None-Verifiable을 다루는 것을 추구해야 함

This post is licensed under CC BY 4.0 by the author.