ByteByteGo 02月02日
EP148: DeepSeek 1-Pager
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文概述了多个技术主题,包括DeepSeek AI模型的突破性进展,其通过Group Relative Policy Optimization技术在推理效率上超越传统模型,并以低成本实现高性能。文章还探讨了软件开发中常用的工具和资源,如Git和GitHub的版本控制差异,以及改善数据库性能的关键策略。此外,文章还列举了18个常见的网络端口及其用途,为读者提供了全面的技术知识概览。同时还推荐了多本经典的软件开发书籍,以及如何利用AI工具提高工作效率。

🚀 DeepSeek R1模型采用混合专家架构,以低成本实现高效推理,其训练成本远低于其他模型,且在数学、编码和推理任务中表现出色。

🗂️ Git专注于本地版本控制,而GitHub是云端代码托管平台,两者互补,共同支持高效的软件开发流程,Git无需网络,GitHub需要网络。

💽 数据库性能优化策略包括索引加速查询、分片和分区管理数据、反规范化减少连接开销、复制提高读取性能以及使用锁定技术管理并发。

🤖 AI Agent 是一种软件程序,可以在没有持续人工干预的情况下自主行动,它能够感知环境,存储知识,并使用工具完成预定目标,能够与其它AI Agent或者人类协作。

📚 文章还推荐了多本经典的软件开发书籍,涵盖了从代码编写到系统架构的各个方面,为开发者提供了宝贵的学习资源。

Build the API your users deserve (Sponsored)

Creating a best in class API experience for your partners and customers shouldn't require an army of engineers. Speakeasy’s platform automates the generation of idiomatic SDKs, custom-branded docs, and robust test suites that are as good as what you'd build yourself. That means you can unlock API revenue while keeping your team focused on what matters most: shipping new products. All you need to get started is your OpenAPI document.

Try Speakeasy for free


This week’s system design refresher:


DeepSeek 1-Pager

It is said to have developed a powerful AI model at a remarkably low cost, approximately $6 million for the final training run. In January 2025, it is said to have released its latest reasoning-focused model known as DeepSeek R1.

The release made it the No. 1 downloaded free app on the Apple Play Store.

Most AI models are trained using supervised fine-tuning, meaning they learn by mimicking large datasets of human-annotated examples. This method has limitations.

DeepSeek R1 overcomes these limitations by using Group Relative Policy Optimization (GRPO), a reinforcement learning technique that improves reasoning efficiency by comparing multiple possible answers within the same context.

Some facts about DeepSeek’s R1 model are as follows:

    DeepSeek-R1 uses a Mixture-of-Experts (MoE) architecture with 671 billion total parameters, activating only 37 billion parameters per task.

    It employs selective parameter activation through MoE for resource optimization.

    The model is pre-trained on 14.8 trillion tokens across 52 languages.

    DeepSeek-R1 was trained using just 2000 Nvidia GPUs. By comparison, ChatGPT-4 needed approximately 25K Nvidia GPUs over 90-100 days.

    The model is 85-90% more cost-effective than competitors.

    It excels in mathematics, coding, and reasoning tasks.

    Also, the model has been released as open-source under the MIT license.

Over to you: Have you tried DeepSeek?

Download the high-resolution image here.


My Favorite 10 Books for Software Developers

General Advice

    The Pragmatic Programmer by Andrew Hunt and David Thomas

    Code Complete by Steve McConnell: Often considered a bible for software developers, this comprehensive book covers all aspects of software development, from design and coding to testing and maintenance.

Coding

    Clean Code by Robert C. Martin

    Refactoring by Martin Fowler

Software Architecture

    Designing Data-Intensive Applications by Martin Kleppmann

    System Design Interview (our own book :))

Design Patterns

    Design Patterns by Eric Gamma and Others

    Domain-Driven Design by Eric Evans

Data Structures and Algorithms

    Introduction to Algorithms by Cormen, Leiserson, Rivest, and Stein

    Cracking the Coding Interview by Gayle Laakmann McDowell

Over to you: What is your favorite book?


How I use 20+ AI models in one app

(PRESENTED BY YOU.COM)

I routinely have ChatGPT, Claude, and DeepSeek open side-by-side because each model excels at tasks that the others don't.

That’s why I like using You.com, the tool that combines the most popular AI models in one app:

Ends soon: Access 12 months of Pro at no cost ($180 value). Just visit the offer page to redeem your special offer as a ByteByteGo newsletter subscriber.


What is an AI Agent?

An AI agent is a software program that can interact with its environment, gather data, and use that data to achieve predetermined goals. AI agents can choose the best actions to perform to meet those goals.

Key characteristics of AI agents are as follows:

An agent can perform autonomous actions without constant human intervention. Also, they can have a human in the loop to maintain control.

Multiple types of AI agents are available such as learning agents, simple reflex agents, model-based reflex agents, goal-based agents, and utility-based agents.

A system with AI agents can be built with different architectural approaches.

    Single Agent: Agents can serve as personal assistants.

    Multi-Agent: Agents can interact with each other in collaborative or competitive ways.

    Human Machine: Agents can interact with humans to execute tasks more efficiently.

Over to you: Have you used AI Agents?


Git vs GitHub

Git and GitHub are popular tools for version control. They work together and complement each other to provide effective source control management.

On a high level, Git is focused on version control and code sharing, whereas GitHub is focused on centralized source code hosting for sharing with other developers.

However, they have some key differences

    Git is a free, open-source version control tool. GitHub is a cloud-based, pay-for-use service that runs Git in the cloud.

    Git is installed locally on a developer’s machine. GitHub is hosted in the cloud.

    The Linux Foundation maintains Git. Microsoft owns GitHub.

    Git can manage different versions of edits, made to files in a git repository. GitHub is a space to upload a copy of the Git repository.

    Git supports version control and source code management. GitHub can be used for hosting code, collaboration, and project management.

    Git has minimal external tool configuration. GitHub provides an active marketplace for tool integration.

Lastly, you can use Git without GitHub but you cannot use GitHub without Git.

Over to you: What else will you add to distinguish between Git and GitHub?


A Cheatsheet on Database Performance

Good database performance is critical since it directly impacts user experience, operational costs, and scalability.

But what impacts database performance?

Evaluating database performance depends on key metrics such as query execution time, throughput, latency, and resource utilization.
Workload Types such as write-heavy, read-heavy, delete-heavy, and competing workloads pose unique challenges.

Other factors that impact performance are item size, item type, dataset size, concurrency expectations, consistency requirements, HA expectations, and geographic distribution.

Multiple strategies exist to improve database performance. Some of the most important ones are as follows:

    Database Indexing
    Indexes are important for speeding up database queries by reducing the amount of data scanned. Also, choosing the right index type is crucial.

    Sharding and Partitioning
    Divide the data into smaller, more manageable chunks known as shards. Each shard is also stored on a different server.

    Denormalization
    Denormalization combines data into fewer tables to reduce the overhead of joins, improving read performance.

    Database Replication
    Replication involves maintaining multiple copies of the same database, typically with a primary node for writes (and critical reads) and secondary nodes for most read operations.

    Database Locking Techniques
    Use locking techniques like pessimistic and optimistic locking to manage concurrency levels and resource contention.

Over to you: Which other database performance strategy will you add to the list?


18 Common Ports Worth Knowing

    FTP (File Transfer Protocol): Uses TCP Port 21

    SSH (Secure Sheel for Login): Uses TCP Port 22

    Telnet: Uses TCP Port 23 for remote login

    SMTP (Simple Mail Transfer Protocol): Uses TCP Port 25

    DNS: Uses UDP or TCP on Port 53 for DNS queries

    DHCP Server: Uses UDP Port 67

    DHCTP Client: Uses UDP Port 68

    HTTP (Hypertext Transfer Protocol): Uses TCP Port 80

    POP3 (Post Office Protocol V3): Uses TCP Port 110

    NTP (Network Time Protocol): Uses UDP Port 123

    NetBIOS: Uses TCP Port 139 for NetBIOS service

    IMAP (Internet Message Access Protocol): Uses TCP Port 139

    HTTPS (Secure HTTP): Uses TCP Port 443

    SMB (Server Message Block): Uses TCP Port 445

    Oracle DB: Uses TCP Port 1521 for Oracle database communication port

    MySQL: Uses TCP Port 3360 for MySQL database communication port

    RDP: Uses TCP Port 3389 for Remote Desktop Protocol

    PostgreSQL: Uses TCP Port 5432 for PostgreSQL database communication

Over to you: Which other port will you add to the list?


SPONSOR US

Get your product in front of more than 1,000,000 tech professionals.

Our newsletter puts your products and services directly in front of an audience that matters - hundreds of thousands of engineering leaders and senior engineers - who have influence over significant tech decisions and big purchases.

Space Fills Up Fast - Reserve Today

Ad spots typically sell out about 4 weeks in advance. To ensure your ad reaches this influential audience, reserve your space now by emailing sponsorship@bytebytego.com.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

DeepSeek AI模型 数据库性能 版本控制 网络端口
相关文章