Java后端系统中处理美团API分页数据的游标式遍历与内存优化策略
在对接美团开放平台时,常需同步大量业务数据(如订单、门店、商品),其API通常采用基于游标的分页机制(如 next_cursor)。若采用传统全量加载或不当的分页循环,极易导致内存溢出(OOM)或频繁GC。本文通过游标迭代器 + 流式处理 + 内存复用设计,实现高效、低内存占用的数据拉取。
1. 美团API分页响应结构建模
美团多数列表接口返回如下结构:
{
"code": 0,
"data": {
"list": [...],
"has_next": true,
"next_cursor": "MTAwMA=="
}
}
对应Java模型:
package baodanbao.com.cn.meituan.model;
import java.util.List;
public class MeituanPageResponse<T> {
private int code;
private DataWrapper<T> data;
public static class DataWrapper<T> {
private List<T> list;
private boolean hasNext;
private String nextCursor;
// getters
public List<T> getList() { return list; }
public boolean isHasNext() { return hasNext; }
public String getNextCursor() { return nextCursor; }
}
// getters
public DataWrapper<T> getData() { return data; }
}

2. 游标式迭代器实现
封装一个 CursorIterator,按需拉取下一页,避免一次性加载:
package baodanbao.com.cn.meituan.iterator;
import baodanbao.com.cn.meituan.client.MeituanApiClient;
import baodanbao.com.cn.meituan.model.MeituanPageResponse;
import baodanbao.com.cn.meituan.model.Order;
import java.util.Iterator;
import java.util.NoSuchElementException;
public class OrderCursorIterator implements Iterator<Order> {
private final MeituanApiClient apiClient;
private String currentCursor = "0";
private Iterator<Order> currentPageIterator = null;
private boolean hasNextPage = true;
public OrderCursorIterator(MeituanApiClient apiClient) {
this.apiClient = apiClient;
}
@Override
public boolean hasNext() {
if (currentPageIterator != null && currentPageIterator.hasNext()) {
return true;
}
if (!hasNextPage) return false;
loadNextPage();
return currentPageIterator != null && currentPageIterator.hasNext();
}
@Override
public Order next() {
if (!hasNext()) {
throw new NoSuchElementException();
}
return currentPageIterator.next();
}
private void loadNextPage() {
MeituanPageResponse<Order> response = apiClient.queryOrders(currentCursor);
if (response == null || response.getData() == null) {
hasNextPage = false;
currentPageIterator = null;
return;
}
var data = response.getData();
currentCursor = data.getNextCursor();
hasNextPage = data.isHasNext();
currentPageIterator = data.getList().iterator();
}
}
3. 内存优化:对象复用与流式处理
为避免 List<Order> 频繁分配,可结合 Stream 和消费者模式逐条处理:
package baodanbao.com.cn.meituan.service;
import baodanbao.com.cn.meituan.client.MeituanApiClient;
import baodanbao.com.cn.meituan.iterator.OrderCursorIterator;
import baodanbao.com.cn.meituan.model.Order;
import java.util.function.Consumer;
public class OrderSyncService {
private final MeituanApiClient apiClient;
public OrderSyncService(MeituanApiClient apiClient) {
this.apiClient = apiClient;
}
public void syncAllOrders(Consumer<Order> processor) {
OrderCursorIterator iterator = new OrderCursorIterator(apiClient);
while (iterator.hasNext()) {
Order order = iterator.next();
// 复用 order 对象?不可行(JSON反序列化新建)
// 但可确保 processor 不持有引用
processor.accept(order);
// 显式置空(非必需,但强调无引用残留)
order = null;
}
}
}
4. HTTP客户端与反序列化优化
使用 OkHttp + Jackson 并配置复用连接池与对象映射器:
package baodanbao.com.cn.meituan.client;
import baodanbao.com.cn.meituan.model.MeituanPageResponse;
import baodanbao.com.cn.meituan.model.Order;
import com.fasterxml.jackson.databind.ObjectMapper;
import okhttp3.*;
import java.io.IOException;
public class MeituanApiClient {
private static final OkHttpClient HTTP_CLIENT = new OkHttpClient.Builder()
.connectionPool(new ConnectionPool(10, 5, java.util.concurrent.TimeUnit.MINUTES))
.build();
private static final ObjectMapper MAPPER = new ObjectMapper();
private static final MediaType JSON = MediaType.get("application/json; charset=utf-8");
public MeituanPageResponse<Order> queryOrders(String cursor) {
String jsonBody = "{\"app_id\":\"YOUR_APP\",\"cursor\":\"" + cursor + "\"}";
RequestBody body = RequestBody.create(jsonBody, JSON);
Request request = new Request.Builder()
.url("https://openapi.meituan.com/v1/order/list")
.post(body)
.addHeader("Authorization", "Bearer YOUR_TOKEN")
.build();
try (Response response = HTTP_CLIENT.newCall(request).execute()) {
if (!response.isSuccessful()) throw new RuntimeException("API call failed");
// 直接反序列化泛型类型,避免中间字符串
return MAPPER.readValue(response.body().byteStream(),
MAPPER.getTypeFactory().constructParametricType(
MeituanPageResponse.class, Order.class));
} catch (IOException e) {
throw new RuntimeException("Failed to fetch orders", e);
}
}
}
5. 使用示例:批量入库但控制批次大小
即使使用游标,若业务逻辑需批量操作(如每100条插入DB),仍需控制局部集合大小:
package baodanbao.com.cn.meituan.app;
import baodanbao.com.cn.meituan.client.MeituanApiClient;
import baodanbao.com.cn.meituan.model.Order;
import baodanbao.com.cn.meituan.service.OrderSyncService;
import java.util.ArrayList;
import java.util.List;
public class SyncRunner {
public static void main(String[] args) {
MeituanApiClient client = new MeituanApiClient();
OrderSyncService service = new OrderSyncService(client);
List<Order> batch = new ArrayList<>(100);
service.syncAllOrders(order -> {
batch.add(order);
if (batch.size() >= 100) {
saveBatchToDatabase(batch);
batch.clear(); // 关键:清空引用,释放内存
}
});
// 处理剩余
if (!batch.isEmpty()) {
saveBatchToDatabase(batch);
}
}
private static void saveBatchToDatabase(List<Order> batch) {
// 模拟DB写入
System.out.println("Saving " + batch.size() + " orders");
}
}
6. 监控与异常恢复
在生产环境中,应记录最后成功游标,支持断点续传:
// 在 processor 中捕获 lastCursor 成功处理位置
// 存入 Redis 或 DB: SET meituan_last_cursor_order_sync <cursor>
// 下次启动时从该 cursor 继续
本文著作权归吃喝不愁app开发者团队,转载请注明出处!
4267

被折叠的 条评论
为什么被折叠?



