•  


Stack Overflow API solutions - Stack Overflow

Improve the performance of AI models & products.

OverflowAPI

OverflowAPI is a subscription-based API service that provides continuous access to Stack Overflow’s public dataset to train and fine-tune large language models.

Become a partner

Join us in creating a new era of socially responsible AI.

We believe AI models and products must provide proper attribution and contribute value back to the communities creating and curating the data that fuels them. Learn more about our definition of socially responsible AI and the commitments we require from our partners.

Read the blog
Google Cloud

Stack Overflow and Google Cloud partner to bring Generative AI to millions of developers through the Stack Overflow platform, Google Cloud Console, and Duet AI.

Read announcement

Access high-quality technical content for commercial use cases.

Only 42% of developers trust the accuracy of AI tools. 1 Improve accuracy, product differentiation, and personalization with Stack Overflow’s dataset.

58M+ human-generated questions and answers with feedback signals from users and moderators.
Top-class technical expertise and experience, expressed with natural language, is ideal for LLM training.
Includes diverse tasks related to coding, advising, debugging, explaining, testing, reviewing, brainstorming, and troubleshooting.
Continuous access to newly created, up-to-date technical knowledge.

Improve model performance with specialized and precise data.

Based on internal & independent tests, fine-tuning on Stack Overflow data results in substantial model performance improvements.

Figure 1. Percent of “Perfect” answers (internal testing)

Based on a proprietary eval set of 1000 Q&A with ground truth answers created from Stack Exchange and Prosus AI Assistant technical Q&A (with highest user rating).

14.13%
Instruction fine tuned
MPT 30B
31.52%
Stack Overflow trained fine tuned
MPT 30B
37.38%
Code fine tuned
Code Llama-2 34B Instruction fine tuned
55.30%
Stack Overflow fine tuned
Code Llama-2 34B
Pre Stack Overflow training / fine tuning
Post Stack Overflow training / fine tuning

Figure 2. ‘InCoder’ model

InCoder found Stack Overflow data improved the HumanEval benchmark and MBPP (Mostly Basic Python Programming) performance.
Baseline
With Stack Overflow data
HumanEval pass@1
5
9
MBPP pass@1
6.1
9.8
- "漢字路" 한글한자자동변환 서비스는 교육부 고전문헌국역지원사업의 지원으로 구축되었습니다.
- "漢字路" 한글한자자동변환 서비스는 전통문화연구회 "울산대학교한국어처리연구실 옥철영(IT융합전공)교수팀"에서 개발한 한글한자자동변환기를 바탕하여 지속적으로 공동 연구 개발하고 있는 서비스입니다.
- 현재 고유명사(인명, 지명등)을 비롯한 여러 변환오류가 있으며 이를 해결하고자 많은 연구 개발을 진행하고자 하고 있습니다. 이를 인지하시고 다른 곳에서 인용시 한자 변환 결과를 한번 더 검토하시고 사용해 주시기 바랍니다.
- 변환오류 및 건의,문의사항은 juntong@juntong.or.kr로 메일로 보내주시면 감사하겠습니다. .
Copyright ⓒ 2020 By '전통문화연구회(傳統文化硏究會)' All Rights reserved.
 한국   대만   중국   일본