Unique Subsets

最后更新于:2022-04-02 01:12:04

# Unique Subsets ### Source - leetcode: [Subsets II | LeetCode OJ](https://leetcode.com/problems/subsets-ii/) - lintcode: [(18) Unique Subsets](http://www.lintcode.com/en/problem/unique-subsets/) ### Problem Given a list of numbers that may has duplicate numbers, return all possible subsets. #### Example If ***S*** = `[1,2,2]`, a solution is: ~~~ [ [2], [1], [1,2,2], [2,2], [1,2], [] ] ~~~ #### Note Each element in a subset must be in **non-descending **order.The ordering between two subsets is free.The solution set must not contain duplicate subsets. ### 题解 此题在上一题的基础上加了有重复元素的情况,因此需要对回溯函数进行一定的剪枝,对于排列组合的模板程序,剪枝通常可以从两个地方出发,一是在返回结果`result.add`之前进行剪枝,另一个则是在`list.add`处剪枝,具体使用哪一种需要视情况而定,哪种简单就选谁。 由于此题所给数组不一定有序,故首先需要排序。有重复元素对最终结果的影响在于重复元素最多只能出现`n`次(重复个数为n时)。具体分析过程如下(此分析过程改编自 [九章算法](http://www.jiuzhang.com))。 以 [1,21,22][1, 2_1, 2_2][1,21,22] 为例,若不考虑重复,组合有 [],[1],[1,21],[1,21,22],[1,22],[21],[21,22],[22][], [1], [1, 2_1], [1, 2_1, 2_2], [1, 2_2], [2_1], [2_1, 2_2], [2_2][],[1],[1,21],[1,21,22],[1,22],[21],[21,22],[22]. 其中重复的有 [1,22],[22][1, 2_2], [2_2][1,22],[22]. 从中我们可以看出只能从重复元素的第一个持续往下添加到列表中,而不能取第二个或之后的重复元素。参考上一题Subsets的模板,能代表「重复元素的第一个」即为 for 循环中的`pos`变量,`i == pos`时,`i`处所代表的变量即为某一层遍历中得「第一个元素」,因此去重时只需判断`i != pos && s[i] == s[i - 1]`(不是 i + 1, 可能索引越界,而i 不等于 pos 已经能保证 i >= 1). ### C++ ~~~ class Solution { public: /** * @param S: A set of numbers. * @return: A list of lists. All valid subsets. */ vector > subsetsWithDup(const vector &S) { vector > result; if (S.empty()) { return result; } vector list; vector source(S); sort(source.begin(), source.end()); backtrack(result, list, source, 0); return result; } private: void backtrack(vector > &ret, vector &list, vector &s, int pos) { ret.push_back(list); for (int i = pos; i != s.size(); ++i) { if (i != pos && s[i] == s[i - 1]) { continue; } list.push_back(s[i]); backtrack(ret, list, s, i + 1); list.pop_back(); } } }; ~~~ ### Java ~~~ class Solution { /** * @param S: A set of numbers. * @return: A list of lists. All valid subsets. */ public ArrayList> subsetsWithDup(ArrayList S) { ArrayList> result = new ArrayList>(); if (S == null) return result; // Collections.sort(S); List list = new ArrayList(); dfs(S, 0, list, result); return result; } private void dfs(ArrayList S, int pos, List list, ArrayList> result) { result.add(new ArrayList(list)); for (int i = pos; i < S.size(); i++) { // exlude duplicate if (i != pos && S.get(i) == S.get(i - 1)) { continue; } list.add(S.get(i)); dfs(S, i + 1, list, result); list.remove(list.size() - 1); } } } ~~~ ### 源码分析 相比前一道题多了去重的判断。 ### 复杂度分析 和前一道题差不多,最坏情况下时间复杂度为 2n2^n2n. 空间复杂度为 O(n)O(n)O(n). ### Reference - [Subsets II | 九章算法](http://www.jiuzhang.com/solutions/subsets-ii/)
';