Henok | Neural Nets

رفتن به کانال در Telegram

Group: https://t.me/neural_netss_chat

نمایش بیشتر

أثيوبيا8 766 فناوری و برنامه‌ها28 997

2 248

مشترکین

-324 ساعت

-87 روز

-3130 روز

922

نمایش های پست

اطلاعاتی وجود ندارد24 ساعت

اطلاعاتی وجود ندارد48 ساعت

41.01%

نرخ مشارکت

اطلاعاتی وجود ندارد

پست های در روز

Ads index

beta

در حال بارگیری داده...

کانال‌های مشابه

کانال‌های بیشتر

ابر برچسب‌ها

اشارات ورودی و خروجی

---

جذب مشترکین

ژوئیه '26

+11

در 1 کانال‌ها

ژوئن '26

+48

در 4 کانال‌ها

Get PRO

مه '26

+91

در 8 کانال‌ها

Get PRO

آوریل '26

+84

در 3 کانال‌ها

Get PRO

مارس '26

+69

در 0 کانال‌ها

Get PRO

فوریه '26

+65

در 5 کانال‌ها

Get PRO

ژانویه '26

+147

در 16 کانال‌ها

Get PRO

دسامبر '25

+169

در 19 کانال‌ها

Get PRO

نوامبر '25

+104

در 8 کانال‌ها

Get PRO

اکتبر '25

+239

در 19 کانال‌ها

Get PRO

سپتامبر '25

+45

در 0 کانال‌ها

Get PRO

اوت '25

+85

در 4 کانال‌ها

Get PRO

ژوئیه '25

+78

در 2 کانال‌ها

Get PRO

ژوئن '25

+66

در 0 کانال‌ها

Get PRO

مه '25

+128

در 5 کانال‌ها

Get PRO

آوریل '25

+94

در 3 کانال‌ها

Get PRO

مارس '25

+87

در 4 کانال‌ها

Get PRO

فوریه '25

+111

در 5 کانال‌ها

Get PRO

ژانویه '25

+159

در 9 کانال‌ها

Get PRO

دسامبر '24

+177

در 7 کانال‌ها

Get PRO

نوامبر '24

+167

در 10 کانال‌ها

Get PRO

اکتبر '24

+161

در 10 کانال‌ها

Get PRO

سپتامبر '24

+241

در 13 کانال‌ها

Get PRO

اوت '24

+421

در 7 کانال‌ها

تاریخ	رشد مشترکین	اشارات	کانال‌ها
30 ژوئیه	+1
29 ژوئیه	+1
28 ژوئیه	+2
27 ژوئیه	+1
26 ژوئیه	+1
25 ژوئیه	0
24 ژوئیه	0
23 ژوئیه	0
22 ژوئیه	0
21 ژوئیه	0
20 ژوئیه	0
19 ژوئیه	+1
18 ژوئیه	0
17 ژوئیه	0
16 ژوئیه	0
15 ژوئیه	0
14 ژوئیه	0
13 ژوئیه	0
12 ژوئیه	0
11 ژوئیه	0
10 ژوئیه	0
09 ژوئیه	0
08 ژوئیه	+1
07 ژوئیه	+2
06 ژوئیه	0
05 ژوئیه	0
04 ژوئیه	0
03 ژوئیه	0
02 ژوئیه	+1
01 ژوئیه	0

پست‌های کانال

Repost from Birhan Nega

የእግርኳስ ሆርሙዝ ሰርጥ 🔥

2	Introducing Papers API • scholarxiv.com/developers We hosted 3,032,697+ (3M+) papers so you don't have to explore research at the rate of one query per 3 seconds (arXiv's API limits) — instead you can explore research at 3,600 queries per hour. That's one query every single second, everyday! With that kind of rate you can imagine what kind of research agents and products you can build! Filter by title, author, category, abstract, or date. Get clean, structured metadata containing titles, authors, abstracts, categories, PDF links and more without parsing XML or rate-limiting yourself. Alongside the Papers API we're also launching our developers platform. This's where you can manage your API keys, track usage, try the playground and explore the documentation. There's a lot more we're building and we can't wait to see what you're going to build with this. #Launch #PapersAPI #DevelopersPlatform @ScholarXIV	706
3	Today in ባህር ዳር	1 037
4	Diogenes the Cynic's story is interesting. If I was a member of the parliament I'd have entered with lit lantern😂 https://penelope.uchicago.edu/encyclopaedia_romana/greece/hetairai/diogenes.html	896
5	DataSpires_JD_CTO_Technical_Lead-3.pdf	1
6	DataSpires_JD_CTO_Technical_Lead-3.pdf	1
7	+1 so for world cup, I predicted 2/2 correct predictions 😎, I mean who is stopping me know. I dare you to ask me who the next PM is going to be	767
8	It's actually very fast, but not sure how much the throughput vs quality trade off is	658
9	Diffusion Gemma is really nice. Making me go back to abandoned diffusion based text generation projects https://deepmind.google/models/gemma/diffusiongemma/	639
10	New 4B translation model from Hasab for few Ethiopian langs. It's good to see almost everyone who is working on AI/ML in Ethiopia is releasing something. This will help a lot to make a progress. Now tag Ethiopian AI Institute to release something too 😁 https://huggingface.co/hasab-ai/YehaTranslate	951
11	I'll share the results of my exploration this weekend. Hopefully a long report	753
12	My weekly GPU usage📊 ~1267.2 kWh According to Gemini this can power an average household refrigerator for 14 to 16 months, or driving a standard electric vehicle (EV) for about 6,000 kms The good thing is the cluster runs on renewable energy so close to zero carbon footprint and recycles the heat back.	740
13	+1 Oh wow	2 109
14	Saw a post about ScholarXIV from Babi, so you heard about fake citations right, well ScholarXIV can help with that🔥 But open research problem could be how can you ground the LLMs so they can cite exactly from the paper without altering results, no errors or attach the citation to the wrong sentence etc Maybe methods like on-policy distillation could help here, let a verifier identify where the model’s claim diverges from the source, then train the model to down weight those unsupported claims, wrong source sentence citations etc.	2 194
15	So I got stuck making an objective function with an anti collapse loss so the embedding space doesn’t collapse into a useless low dimensional subspace 😞 maybe venting here helps lol, tips are also welcomed	735
16	But we had a retweet from him😁, not as good as Arsenal's win but a win is a win	806
17	I've been a fan of Sasha for quite a while. I even tried emailing him and applied to be advised by him in 2022 when he was at Cornell Tech. I didn't get a reply on that email 😂. He also got great students, he now works at Cursor, if you follow this channel I also post some of the puzzle he made like the GPU puzzle etc. This was a post from Dwarksh yesterday. Don't be shy just email any researcher, start today, who cares. You either get better mentors, or watch that person get famous and flex saying I've emailed that guy before 😂 Recently met @srush_nlp and he started giving me an impromptu lecture on how targeted on-policy self-distillation works. I asked him if I could record it on my iPhone. The basic idea is this: if the model made a mistake at some point in the rollout (for example, calling a tool that doesn't exist), we want to discourage this specific error, but we don't want to just learn from the final reward, because it's a very noisy signal spread out over the whole trajectory. So we have another model read this trajectory and figure where the error was made. It simply inserts some hint tokens to the part of the trajectory right above where the mistake was made. Now with these injected hint tokens, have the model run a forward pass. You're not having to regenerate a new rollout - aka no new decode required. The hint causes the model to assign lower probabilities to the error tokens. You then trains the original model to match these new probabilities, teaching it to downweight that specific mistake. https://x.com/dwarkesh_sp/status/2062353335529935114?s=20	701
18	+4 Gheero(formerly iCog) is opening applications for its Applied AI & Machine Learning Residency Program, an 8-week program for people who want to build real AI systems, not just study the concepts. Residents will work on practical AI and machine learning projects, learn how models are designed, trained, evaluated, and integrated into real products, and get mentorship from experienced builders. The program focuses on applied AI, engineering discipline, problem-solving, and technical growth. Strong participants may also be considered for internship or full-time roles after the residency. To apply, send your CV and relevant documents to recruitment@gheero.et with the subject line: Applied AI & Machine Learning Residency Program. Read more here	521
19	Let's talk about Metro So here they have the Montreal Metro, which started in 1966. It's so big and complex and serves over 1 Million people daily🤯. Now the crazy thing is, in today's estimate the cost to build it, is around $1.5 billion. This money is about 1/3 of the total cost of Renaissance Dam(I saw it took about $5 billion). So they spent this amount of money to make an infrastructure for just one city. Now I started thinking what a metro like this in Addis could solve. You could go from Alem Gena to Lege Tafo or from Saris to Gulele easily and fast. More over you will have a predicable travel time and solves የሃበሻ ቀጠሮ at least in Addis hehe	571
20	+6 Summer is here btw	594

مشاهده همه پست‌ها