LeetCode 187. Repeated DNA Sequences

本文介绍了一个函数,用于在DNA分子中查找所有长度为10的重复序列子串。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

#include <unordered_map>
#include <vector>
#include <iostream>
using namespace std;

/*
   All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to identify repeated sequences within the DNA.

Write a function to find all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule.

  Given s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT",

  Return:
  ["AAAAACCCCC", "CCCCCAAAAA"].
*/

vector<string> findRepeatedDnaSequences(string s) {
  if(s.size() < 10) return {};
  vector<string> res;
  unordered_map<string, int> indexToString;
  for(int i = 0; i <= s.size() - 10; ++i) {
    string tmp = s.substr(i, 10);
    auto iter = indexToString.find(tmp);
    if(iter == indexToString.end()) indexToString.insert({tmp, 1});
    else iter->second += 1;
  }
  auto iter = indexToString.begin();
  while(iter != indexToString.end()) {
    if(iter->second > 1) res.push_back(iter->first);
    iter++;
  }
  return res;
}

int main(void) {
  vector<string> res = findRepeatedDnaSequences("AAAAAAAAAAA");
  for(int i = 0; i < res.size(); ++i) {
    cout << res[i] << endl;
  }
}

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值