iconiconicon
kAIto47802
Fourth year undergraduate student at the University of Tokyo Department of Mathematical Engineering and Information Physics, Faculty of Engineering
GitHub

Publications

International Conference

The T05 System for the VoiceMOS Challenge 2024: Transfer Learning from Deep Image Classifier to Naturalness MOS Prediction of High-Quality Synthetic Speech
Kaito Baba, Wataru Nakata, Yuki Saito, and Hiroshi Saruwatari
IEEE Spoken Language Technology Workshop (SLT), 2024
GitHubarXivPoster

Workshop

JRadiEvo: A Japanese Radiology Report Generation Model Enhanced by Evolutionary Optimization of Model Merging
Kaito Baba, Ryota Yagi, Junichiro Takahashi, Risa Kishikawa, Satoshi Kodera
AIM-FM Workshop @ NeurIPS, 2024
arXiv
Application of Contrastive Learning on ECG Data: Evaluating Performance in Japanese and Classification with Around 100 Labels
Junichiro Takahashi, Yasukawa Kan, Kaito Baba, Satoshi Kodera
AIM-FM Workshop @ NeurIPS, 2024

Domestic Conference

UTMOSv2: 自然性MOS予測におけるスペクトログラム特徴量とSSL特徴量の統合的利用
馬場 凱渡,中田 亘,齋藤 佑樹,猿渡 洋
日本音響学会第152回 (2024年秋季) 研究発表会, 2024
GitHub

Awards and Honors

~08/2024
4 bronze medals🥉 in Kaggle competitions
~08/2024
Won 4 solo bronze medals so far in Kaggle, the world's largest machine learning competition.
06/2024
1st place🥇 in 7 out of 16 metrics and 2nd place🥈 in the remaining 9 metrics in the VoiceMOS Challenge 2024 Track1
06/2024
Took the above ranking in a global competition to create a machine learning model to predict the naturalness MOS of synthetic speech.
03/2024
1st Place, Hackathon by AIFUL (aihack2024)🥇
03/2024
Developed a machine learning model for predicting the credit score of individuals.
10/2023
Special Prize, 3rd Place, Hackathon by LIFECARD🥉
10/2023
Created a machine learning model for credit scoring.
09/2023
World Rank 3, CodinGame Othello AI competition🥉
09/2023
Developed an AI for the game of Othello by combining deep reinforcement learning and heuristic search algorithms.
03/2023
Special Prize, 4th Place in the Final Round, Hackathon by SoftBank🏅
03/2023
Developed a web application integrated with object detection AI.
03/2023
Outstanding Achievement Award, Deep Reinforcement Learning Spring Seminar, Matsuo Laboratory🏅
03/2023

Work Experience

07/2024
- Present
Preferred Networks, Inc.
Part-time Engineer
07/2024 - Present
Developing Optuna, an hyperparameter optimization library, as part of the AutoML team.
05/2024
- Present
The University of Tokyo Hospital
Technical Staff (Part-time)
05/2024 - Present
Conducting research and development of medical AI in collaboration with physicians.
2023/12
- 2024/05
SIGNATE Inc.
Technical Advisor, Educational Material Creator (Outsourcing)
2023/12 - 2024/05
Created materials on RAG (Retrieval-Augmented Generation) and prompt engineering as an LLM Advisor.
Created educational materials on data analysis, LLM usage, algorithms, optimization, etc.
2023/11
- 2024/04
kuzen Inc.
Full-stack Engineer (Internship)
2023/11 - 2024/04
Involved in front-end and back-end implementation, as well as the development of a RAG system using LangChain.
2023/10
- 2023/12
Recruit Co., Ltd
Datascientist (Internship)
2023/10 - 2023/12
Worked in the Search Group of the Data Promotion Office, developing a RAG system to improve the internal document search experience.
07/2023
- Present
The University of Tokyo Edge Capital Partners Co., Ltd. (UTEC)
Research Assistant
07/2023 - Present
Conducting data collection, analysis, and predictive model creation for investors.
Developing a system for an internal tool.
2023/04
- 2023/09
LearnWiz Inc.
Font-end Engineer (Outsourcing)
2023/04 - 2023/09
Carried out front-end implementation and proposed UI/UX design for web applications.
2023/01
- 2023/09
GHELIA Inc.
AI Researcher (Internship)
2023/01 - 2023/09
Conducted research and development in deep learning related to image recognition.

Technical Skills

Deal with:
Deep learning, reinforcement learning, front-end development, back-end development, iOS app development, etc.
Languages and Frameworks:
Advanced:
Rust, Python, PyTorch, TensorFlow, TypeScript, JavaScript, React, Next.js, HTML/CSS, C, C++, bash, Swift, Go, Google Apps Script, and R
Intermediate:
Haskell, OCaml, Scheme, Assembly (RISC-V), SQL, Fortran, Verilog, C# (Unity), Julia, Java, and Kotlin
Others:
Git/GitHub, Docker, AWS, GCP, Linux, GitHub Actions, GraphQL, Arduino, Raspberry Pi, etc.