•  


Question about hardware requirements · Issue #102 · getumbrel/llama-gpt · GitHub
Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement . We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about hardware requirements #102

Open
cdmoss opened this issue Sep 11, 2023 · 3 comments
Open

Question about hardware requirements #102

cdmoss opened this issue Sep 11, 2023 · 3 comments

Comments

@cdmoss
Copy link

Hello - excellent project, I'm super excited about any option for decentralized AI.

I'm extremely green to this domain - I was hoping someone could help me understand GPU requirements for deploying this. I see many other in the issues boasting 4090s, dual 3080s, etc. - I'm wondering if it's viable for me to try using the 70b with my paltry 6700xt and ryzen 3700x (or what the max model to use with no GPU at all if any, and what the state of AMD support is). I recognize all this info is out there - I'd greatly appreciate links to resources for my own research.

@stratus-ss
Copy link

I can't speak to the state of support with AMD is, but currently for GPU usage, CUDA is required. Largely I believe the larger number of cuda cores has a pretty large impact on performance (as does the amount of ram in each card).

As for the 70b, I am running that as a test on a i5-9400k with 64G of ram and no GPU support. Its fairly slow, especially compared to running the 13b model. I am running a 13b model with decent performance in a vm on top of a ryzen 5 3600x, so from that perspective you should be good.

@hoelee
Copy link

I run 34b model with 13900k CPU only, 32 threads 100% running still taking minutes in order to get full reply. I'm not sure if this speed is what I was expecting.

@istler
Copy link

Feels like 34b is out of reach for mere mortals.

Sign up for free to join this conversation on GitHub . Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants
- "漢字路" 한글한자자동변환 서비스는 교육부 고전문헌국역지원사업의 지원으로 구축되었습니다.
- "漢字路" 한글한자자동변환 서비스는 전통문화연구회 "울산대학교한국어처리연구실 옥철영(IT융합전공)교수팀"에서 개발한 한글한자자동변환기를 바탕하여 지속적으로 공동 연구 개발하고 있는 서비스입니다.
- 현재 고유명사(인명, 지명등)을 비롯한 여러 변환오류가 있으며 이를 해결하고자 많은 연구 개발을 진행하고자 하고 있습니다. 이를 인지하시고 다른 곳에서 인용시 한자 변환 결과를 한번 더 검토하시고 사용해 주시기 바랍니다.
- 변환오류 및 건의,문의사항은 juntong@juntong.or.kr로 메일로 보내주시면 감사하겠습니다. .
Copyright ⓒ 2020 By '전통문화연구회(傳統文化硏究會)' All Rights reserved.
 한국   대만   중국   일본