ByteByteGo 前天 23:33
EP169: RAG vs Agentic RAG
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本周的技术周刊涵盖了多个热门领域,包括通过自动化QA加速移动应用发布、深度探讨RAG与Agentic RAG的区别,以及Kubernetes的基础知识。此外,文章还介绍了6种节省存储的数据结构和5种数据库规范化形式。最后,周刊还分享了高薪技术职位招聘信息,并推荐了AI代码助手效率衡量方法。总而言之,这是一份面向技术专业人士,内容丰富、实用性强的技术周刊。

📱 移动应用自动化测试: QA Wolf 提供Appium测试套件,实现100%并行自动化测试,将测试周期缩短至15分钟以内,并在真实设备和模拟器上运行测试,实现无错误运行。

💡 RAG vs Agentic RAG: RAG(检索增强生成)结合信息检索与大语言模型生成答案。Agentic RAG引入AI智能体,能够做出决策、选择工具甚至优化查询,从而获得更准确、灵活的响应。

🐳 Kubernetes 基础知识: Kubernetes是一个开源容器编排平台,开发者通过创建manifest文件定义应用。Kubernetes通过控制平面和节点运行,常用资源包括 Pods、Deployments、Services 和 Persistent Volumes等。

💾 存储数据结构: Bloom Filter用于测试元素是否属于集合;HyperLogLog估算多重集合中唯一元素的数量;Cuckoo Filter是Bloom Filter的替代方案,支持删除并具有更好的查找性能;Minhash用于快速估计大型集合之间的相似性;SkipList是一种快速搜索、插入和删除操作的链表结构;Count-Min Sketch用于估算大数据流中项目的频率。

🗄️ 数据库规范化: 数据库规范化旨在消除冗余并确保数据完整性,包括第一范式(1NF)、第二范式(2NF)、第三范式(3NF)和BCNF,以及第四范式(4NF)。

🚀 Faster mobile app releases with 100% parallel automated QA (Sponsored)

Manual testing on personal devices is too slow and too limited. It forces teams to cut releases a week early just to test before submitting them to app stores. And without broad device coverage, issues slip through.

QA Wolf gets mobile apps to 80% automated test coverage in weeks. They create and maintain your test suite in Appium (no vendor lock-in) — and provide unlimited, 100% parallel test runs with zero flakes.

✅ QA cycles reduced from hours to less than 15 minutes

✅ Tests run on real iOS devices and Android emulators

✅ Flake-free runs, no false positives

✅ Human-verified bug reports

No more manual E2E testing. No more slow QA cycles. No more bugs reaching production.

Rated 4.8/5 ⭐ on G2

Schedule a demo to learn more


This week’s system design refresher:


7 System Design Concepts Explained in 10 Minutes


Measuring the productivity gains from AI code assistants— July 17th (Sponsored)

To navigate the AI era, leaders need metrics on the efficiency of AI code tools and agents. Join this discussion with Abi Noda and Laura Tacho, CEO and CTO at DX, to learn how to apply the AI Measurement Framework to track AI adoption, measure impact, and make smarter investments.

Join this discussion to learn:

Register to join


RAG vs Agentic RAG

RAG (Retrieval Augmented Generation) is a method that combines information retrieval with large language models to generate answers. Here’s how RAG works on a high level:

    The model retrieves relevant data from data sources and then extracts it to a vector database from the pre-indexed model.

    Augment the prompts by retrieving information and merging it with the query prompt.

    A Large Language Model (like GPT, Claude, or Gemini) understands the combined query and generates the final response.

A traditional RAG has a simple retrieval, limited adaptability, and relies on static knowledge, making it less flexible for dynamic and real-time information.

Agentic RAG improves on this by introducing AI agents that can make decisions, select tools, and even refine queries for more accurate and flexible responses. Here’s how Agentic RAG works on a high level:

    The user query is directed to an AI Agent for processing.

    The agent uses short-term and long-term memory to track query context. It also formulates a retrieval strategy and selects appropriate tools for the job.

    The data fetching process can use tools such as vector search, multiple agents, and MCP servers to gather relevant data from the knowledge base.

    The agent then combines retrieved data with a query and system prompt. It passes this data to the LLM.

    LLM processes the optimized input to answer the user’s query.

Over to you: What else will you add to better understand RAG vs Agentic RAG?


A Cheatsheet on Kubernetes

Kubernetes (K8S) is an open-source container orchestration platform originally developed by Google and now maintained by the Cloud Native Computing Foundation (CNCF).

Developers working with Kubernetes create manifest files that describe their application and specify the number of instances, resource requirements, and other configurations.

Kubernetes operates using a control pane and a group of nodes.
The control plane is deployed on the master node and manages the overall state of the cluster. It consists of components like API Server, Etcd, Controller Manager, and Scheduler.

Nodes are the workers in a Kubernetes cluster. Each node contains components like the Kubelet and Kube-proxy and is responsible for running the containerized applications.

Some top Kubernetes resources are Pods, Deployments, Services, and Persistent Volumes.

    Pods encapsulate one or more containers.

    Deployment is a higher-level abstraction that manages Pods.

    Service is an abstraction that provides a stable way to expose and access a set of Pods.

    Volume is a storage resource that helps retain data between pod restarts or recreations.

The Horizontal Pod Autoscaler helps scale the number of Pods in a Deployment, ReplicaSet, or StatefulSet based on observed CPU utilization, memory usage, or custom metrics. It monitors resource usage and adjusts the number of replicas to match the desired resource targets.

Over to you: Have you used Kubernetes in your projects?


6 Data Structures to Save Storage

    Bloom Filter
    A probabilistic data structure used to test whether an element is a member of a set.

    HyperLogLog
    An algorithm that approximates the number of unique elements in a multi-set using minimal memory.

    Cuckoo Filter
    A space-efficient alternative to Bloom filters that supports deletion and has better lookup performance.

    Minhash
    A technique for quickly estimating the similarity between large sets using compressed hash signatures.

    SkipList
    A layered linked list structure that allows fast search, insert, and delete operations.

    Count-Min Sketch
    A probabilistic data structure that approximates the frequency of items in a large data stream

Over to you: Which other data structure will you add to the list?


5 Database Normal Forms Every Developer Should Know

Normalization aims to eliminate redundancy and enforce data integrity by organizing data into logical, dependency-driven forms.

    First Normal Form (1NF): Removes repeating groups and ensures atomic values in each column.

    Second Normal Form (2NF): Removes partial dependencies by ensuring all non-key columns depend on the full primary key.

    Third Normal Form (3NF): Eliminates transitive dependencies, ensuring non-key columns depend only on primary keys.

    Boyce-Codd Normal Form (BCNF): Strengthens 3NF by removing anomalies that exist due to overlapping candidate keys. If one column depends on some other column, then that “other column” must be enough to identify each row uniquely.

    Fourth Normal Form (4NF): Ensures that a table does not mix multiple independent one-to-many relationships for the same entity.

Over to you: Which other Normal Form will you add to the list?


Hiring Now

We collaborate with Jobright.ai (an AI job search copilot trusted by 500K+ tech professionals) to curate this job list.

This Week’s High-Impact Roles at Fast-Growing AI Startups

High Salary SWE Roles this week

Today’s latest ML positions - hiring now!


SPONSOR US

Get your product in front of more than 1,000,000 tech professionals.

Our newsletter puts your products and services directly in front of an audience that matters - hundreds of thousands of engineering leaders and senior engineers - who have influence over significant tech decisions and big purchases.

Space Fills Up Fast - Reserve Today

Ad spots typically sell out about 4 weeks in advance. To ensure your ad reaches this influential audience, reserve your space now by emailing sponsorship@bytebytego.com.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

自动化测试 RAG Kubernetes 数据库规范化 技术周刊
相关文章