Society's Backend 前天 21:43
ML Jobs, Resources, and Content for Software Engineers #14: How do we combat cognitive decline?
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本周的《软件工程师机器学习周刊》聚焦于AI领域的最新发展,包括Meta公司在超级智能团队的招聘、Anthropic的实验结果、Airtable推出AI应用创建助手、以及谷歌DeepMind的Gemma 3n模型发布等。文章还探讨了AI对教育的影响、自驾技术的进步,以及工程师如何通过学习和实践提升自身技能。此外,还提供了关于英伟达Blackwell GPU、vLLM V1推理效率,以及AI市场动态的深度分析,为工程师们提供了丰富的学习资源和行业洞察。

🧠Meta公司正在积极招聘AI人才,组建超级智能团队,开出高薪和丰厚签约奖金,引发了行业内的关注。

🤖Anthropic公司进行的实验表明,AI在某些任务上表现出色,但在经济效益方面仍有局限性,尤其是在没有人类监督的情况下。

💡Airtable推出AI应用创建助手,用户可以通过该助手构建应用,并利用AI与网络上的各种组件进行交互,这预示着AI将改变互联网信息交互方式。

🚀谷歌DeepMind发布了Gemma 3n模型,该模型在边缘设备上也能运行强大的多模态AI,且在LMArena中表现出色,仅次于Gemini 1.5 Pro。

🚗特斯拉完成了首次完全无人驾驶的车辆交付,展示了自动驾驶技术的实际应用和潜在优势,引发了人们对未来出行的思考。

Welcome to the weekly Machine Learning for Software Engineers newsletter! Each week I curate an AI reading list specifically for software engineers. What does that mean? It means this weekly newsletter contains everything software engineers should know about AI:

I try to make these dense so they’re as beneficial for you as possible. Subscribe to get these in your inbox each week.

Subscribe now

If you read these each week, welcome back! You’ll notice I’ve included a jobs section this week. I plan on doing this each week to give y’all a general overview of available positions. I’ve got a few articles upcoming that I’m really excited about. Stay tuned for:

Enjoy the weekly update!


I ran into an interesting post this past week I want to share with y’all:

Like all transformative technology (think: the internet, video games, video streaming, etc.), we’ll need to determine our relationship with AI. We’ve already seen a reliance on LLMs decline cognitive ability and I honestly fear for the downsides of AI reliance especially in young kids that are going through their schooling with it.

The most important thing I was taught in school was how to think. In my opinion, that is the purpose of an education. You absolutely should learn basic information needed for life, but you should also be taught to think for yourself and how to deduce more information based on the information given to you. If you haven’t learned this, your education has failed you.

This learned skill is like a muscle: The more you use it, the better it gets. If you stop using it, it atrophies. Just like working out to grow muscle, the process to develop this skill requires a struggle.

Technological advancement has necessitated physical workouts to keep our bodies in healthy shape as jobs have shifted to be more stagnant and less physical. We created gyms and other sources of exercise to overcome this stagnation, yet so many people fail to utilize them and we’ve seen the impact that has had on the health of our populace. I argue that an atrophy of the same level, but in our minds, would be much more detrimental to the populace.

So the question is: What’s the mental equivalent of a gym and how do we ensure it’s effective? AI isn’t going anywhere and there’s no doubt it’ll be economically transformative, but what should our relationship with it look like?

Let me know what you think in the comments.


If you find ML for SWEs helpful, consider becoming a paid subscriber (and get all the articles above!). It’s still just $3/mo, but will go up to $5/mo after 10 more subscriptions.

Get 40% off forever


Important things that happened last week

Zuck is recruiting ruthlessly for his Superintelligence team at Meta (to be called the Meta Superintelligence Lab or MSL). Mark Zuckerberg is personally recruiting high-profile AI talent, much of which has come from OpenAI. He’s reportedly offering 7 figure+ salaries and generous signing bonuses (not $100M like many claim, but still high).

This initially prompted Sam Altman to say Meta isn’t creating a great culture by poaching talent and that none of their really impressive talent had left. Less than a week later, OpenAI released a statement saying they will be reviewing their employees’ compensation packages to ensure they keep talent.


Anthropic had Claude run a vending machine and it was wildly unprofitable. Anthropic let “Claudius” control the shop w/o human oversight. I suggest reading Anthropic’s report above for all of Claudius’s intricacies. The outcome: AI is great for certain tasks (research, web browsing, text generation, etc.) but not economically competent on its own (in Claude’s current state, at least).

Claudius’s profit over time. Source.

Airtable has released an AI app-creating assistant. The Airtable CEO believes app creation is the new interface for which AI will interact with components around the web. Airtable’s new assistant allows users to create apps and interfaces they want and use Omni (the assistant) to interact with those interfaces and the data they contain. AI will fundamentally change the way information on the internet will be interacted with and this is a good look at how.


A court ruled in favor of Anthropic and Meta purchasing copyrighted material and using it to train their AI (specifically books, but it will likely extend to other mediums). The court ruled this as ‘fair use’. This will not only settle other lawsuits still in progress, but its huge for generative AI companies looking to acquire training data for their models AND has implications for authors and creators about how they’ll publish their creations.


Google DeepMind introduces Gemma 3n. This new open model brings powerful, multimodal AI to edge devices, meaning it can run within <3GB of memory. It’s the highest scoring model of its class in LMArena coming in right under Gemini 1.5 Pro. Google credits this advancement to Gemma’s mobile-first architecture which you can read more about at the link above.


PadChest-GR: A new benchmark for testing biomedical imaging models. We’ve already seen AI outperform radiologists at diagnosing disease by interpreting medical images, but AI is still seen as a copilot because the cost of a false negative in this domain too high to risk. This benchmark aims to accurately evaluate the performance of these models in the real world.


DeepMind releases AlphaGenome to predict how DNA variants impact gene regulation. AlphaGenome is available via API and aims to accelerate scientific understanding of disease and drive the discovery of new treatments. I’m not going to pretend I fully understand AlphaGenome’s applications, but this is an incredible application for AI (much better than most LLM applications we’ve seen so far).


Learn a language by chatting with a voice AI assistant. The best way to learn a language is by speaking it. Previously, this meant immersion, but now we have AI assistants that can speak the language and direct learning as the user needs. You can try it out yourself at the link above. It still needs tweaking, but it shows the potential of this application.


Anthropic allows for building and sharing interactive apps directly in Claude. Users can create applications in English and share them with others. Claude Artifacts acts as a personal app gallery with each app operating in its own sandboxed environment.


Apple is thinking of using Claude or ChatGPT to power Siri. This is a huge diversion from Apple’s usual ‘built in house’ mantra. They have a tight control over their hardware and software and using an external AI assistant relinquishes some of this control. This could be the end of an era for Apple and it shows how their AI blunder is having a very real impact.


Tesla delivered their first vehicle completely operator-free. This means there was no one in the car as it drove from Tesla’s factory to the recipient in Austin. I love this because it’s a concrete example of the benefit of self-driving technology. Imagine every car being delivered directly to the recipient. The time-savings of not have to be at a dealership all day will alone double the US GDP.

Humor aside, I don’t think most people realize the benefit of self-driving cars. There’s a lot of fear in this area because mistakes are costly, but the data shows just how much safer self-driving cars are than humans. Self-driving cars means lives saved and less time wasted commuting. You can see the video of Tesla’s delivery below.


If you want a more complete overview of AI this week, read ’s weekly roundup. His is one of few overviews I read each week:

Artificial Ignorance
AI Roundup 124: $uperintelligence
$uperintelligence…
Read more

Resources so you can become a better engineer

Blackwell: Nvidia's GPU

Nvidia's Blackwell GPUs achieve immense scale by prioritizing Streaming Multiprocessor density. The GB202 die is massive, featuring 92.2 billion transistors and 192 Streaming Multiprocessors. Its 1:16 SM to GPC ratio allows high compute density, though it can challenge feeding short-duration tasks. Blackwell also improves workload overlap compared to previous generations.

Life of an inference request (vLLM V1): How LLMs are served efficiently at scale

Optimize large language model inference at scale with vLLM V1. Ubicloud leverages vLLM to deploy and manage large language models across GPUs. It efficiently load balances incoming requests, ensuring scalable and robust service. The article details the path of an inference request through vLLM's OpenAI-compatible API server and core engine.

Why "AI Hate" is Your Next Billion-Dollar Opportunity [Markets] by

Turn AI dissent into your next billion-dollar investment. The growing negative sentiment towards Artificial Intelligence, driven by real concerns and public distrust, creates a unique contrarian market opportunity. This "AI hate" acts as an essential antithesis to pervasive AI hype, leading to a synthesis that yields more robust and widely adoptable technologies.

The Google for Startups Gemini kit is here

The Google for Startups Gemini Kit empowers entrepreneurs to build faster using AI. This new, free suite provides essential tools, credits, and support for integrating AI into products. It streamlines the adoption process for startups at any stage.

Watch the full video here:


Collaborative filtering deep dive

A guide/Jupyter notebook to get in-depth with collaborative filtering, an algorithm commonly used by recommendation engines (think video or song recommendations that would be used by companies like Netflix or Spotify). I’ll be writing more about this (and introducing it via code) in an upcoming article, so stay tuned!

This week’s jobs

Here are some interesting opening this week. I also post these on my socials, so make sure to follow me on X and LinkedIn. I’ll develop this section as time goes on to include more and more relevant jobs.

If you want to learn more about the current job market, support ML for SWEs as a paid subscriber and you’ll get less frequent, but much more comprehensive AI job market overviews.


1. ML/AI Engineer CAD Infrastructure - Tenstorrent (Remote)

Overview: Develop ML-powered systems to automate post-silicon validation, debug, and root cause analysis for semiconductors. This role builds and maintains the infrastructure for large-scale silicon testing and creates predictive models to analyze behavior, optimize performance, and detect anomalies.

Key Skills/Requirements:

2. Machine Learning Engineer Graduate - TikTok (San Jose, CA)

Overview: Participate in the development of large-scale Ads systems and next-generation monetization platforms. Responsibilities include developing state-of-the-art applied machine learning projects, owning key targeting components, and working with product teams on product vision.

Key Skills/Requirements:


3. Quantitative Researcher Machine Learning - Two Sigma (New York, United States)

Overview: Use a rigorous scientific method to develop sophisticated trading models and shape insights into how markets will behave. Apply machine learning to vast datasets to create and test complex investment ideas for trading a variety of global markets.

Key Skills/Requirements:


4. Senior Engineer AI Research - Qualcomm (San Diego, CA)

Overview: Conduct fundamental and applied research in machine learning, creating new models and training methods in areas like generative AI, LLMs, and reinforcement learning. Responsibilities also include developing and optimizing software, tools, and compilers to deploy efficient machine learning solutions on resource-constrained devices across mobile, automotive, and IOT sectors.

Key Skills/Requirements:


5. Early Career Machine Learning Engineer - NFX (San Francisco Bay Area)

Overview: Develop and deploy machine learning, NLP, and generative AI models that power the Piai™ claims-intelligence platform. You will turn raw legal and medical data into production-ready models that directly improve justice for personal-injury clients by helping law firms secure better outcomes.

Key Skills/Requirements:


Thanks for reading!

Always be (machine) learning,

Logan

Share

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI 机器学习 软件工程 行业动态
相关文章