From 95bc4e4d54ec506df50fcc2ec6d76be3107ce60d Mon Sep 17 00:00:00 2001
From: "Documenter.jl" <documenter@juliadocs.github.io>
Date: Sun, 24 Mar 2024 16:55:22 +0000
Subject: [PATCH] build based on 5bb8db9

---
 dev/.documenter-siteinfo.json |   2 +-
 dev/index.html                |   2 +-
 dev/kmer_int_repr/index.html  |   2 +-
 dev/objects.inv               | Bin 494 -> 537 bytes
 dev/references/index.html     |   2 +-
 dev/search_index.js           |   2 +-
 6 files changed, 5 insertions(+), 5 deletions(-)
diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json
index d518445..1011b9e 100644
--- a/dev/.documenter-siteinfo.json
+++ b/dev/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.10.2","generation_timestamp":"2024-03-24T02:13:23","documenter_version":"1.3.0"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.10.2","generation_timestamp":"2024-03-24T16:55:19","documenter_version":"1.3.0"}}
\ No newline at end of file
diff --git a/dev/index.html b/dev/index.html
index e762b8e..7999e8c 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -26,4 +26,4 @@
  ⋮
  0
  1
- 0</code></pre><h2 id="Limitations"><a class="docs-heading-anchor" href="#Limitations">Limitations</a><a id="Limitations-1"></a><a class="docs-heading-anchor-permalink" href="#Limitations" title="Permalink"></a></h2><p>The main downside of counting <span>$K$</span>-mers this way is that the arrays grow exponentially with respect to <span>$K$</span>. The 31-mer array of a DNA sequence would have a length of <span>$4^{31} = 4,611,686,018,427,387,904$</span>, which is equivalent to four exbibytes of memory, if the values are stored with 8-bit integers — which is just not feasible, really. Not only does allocating a lot of memory take up a lot of memory, but it can also take a substantial amount of time! This method of counting <span>$K$</span>-mers therefore works best for lower <span>$K$</span>-values.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="kmer_int_repr/">Integer representation of k-mers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Sunday 24 March 2024 02:13">Sunday 24 March 2024</span>. Using Julia version 1.10.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ 0</code></pre><h2 id="Limitations"><a class="docs-heading-anchor" href="#Limitations">Limitations</a><a id="Limitations-1"></a><a class="docs-heading-anchor-permalink" href="#Limitations" title="Permalink"></a></h2><p>The main downside of counting <span>$K$</span>-mers this way is that the arrays grow exponentially with respect to <span>$K$</span>. The 31-mer array of a DNA sequence would have a length of <span>$4^{31} = 4,611,686,018,427,387,904$</span>, which is equivalent to four exbibytes of memory, if the values are stored with 8-bit integers — which is just not feasible, really. Not only does allocating a lot of memory take up a lot of memory, but it can also take a substantial amount of time! This method of counting <span>$K$</span>-mers therefore works best for lower <span>$K$</span>-values.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="kmer_int_repr/">Integer representation of k-mers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Sunday 24 March 2024 16:55">Sunday 24 March 2024</span>. Using Julia version 1.10.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/kmer_int_repr/index.html b/dev/kmer_int_repr/index.html
index 249527d..d2244d3 100644
--- a/dev/kmer_int_repr/index.html
+++ b/dev/kmer_int_repr/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Integer representation of k-mers · VectorizedKmers.jl</title><meta name="title" content="Integer representation of k-mers · VectorizedKmers.jl"/><meta property="og:title" content="Integer representation of k-mers · VectorizedKmers.jl"/><meta property="twitter:title" content="Integer representation of k-mers · VectorizedKmers.jl"/><meta name="description" content="Documentation for VectorizedKmers.jl."/><meta property="og:description" content="Documentation for VectorizedKmers.jl."/><meta property="twitter:description" content="Documentation for VectorizedKmers.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="VectorizedKmers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">VectorizedKmers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li class="is-active"><a class="tocitem" href>Integer representation of k-mers</a></li><li><a class="tocitem" href="../references/">API Reference</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Integer representation of k-mers</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Integer representation of k-mers</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl/blob/main/docs/src/kmer_int_repr.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Integer-representation-of-K-mers"><a class="docs-heading-anchor" href="#Integer-representation-of-K-mers">Integer representation of K-mers</a><a id="Integer-representation-of-K-mers-1"></a><a class="docs-heading-anchor-permalink" href="#Integer-representation-of-K-mers" title="Permalink"></a></h1><p>This package relies on representing K-mers as integers for indexing.</p><p>For DNA, each non-ambiguous nucleotide is assigned a number between 0 and 3:</p><table><tr><th style="text-align: right">Nucleotide</th><th style="text-align: right">Base-4</th><th style="text-align: right">Base-2</th></tr><tr><td style="text-align: right">A</td><td style="text-align: right">0</td><td style="text-align: right">00</td></tr><tr><td style="text-align: right">C</td><td style="text-align: right">1</td><td style="text-align: right">01</td></tr><tr><td style="text-align: right">G</td><td style="text-align: right">2</td><td style="text-align: right">10</td></tr><tr><td style="text-align: right">T</td><td style="text-align: right">3</td><td style="text-align: right">11</td></tr></table><p>Any ordering works, but this is the one used by <a href="https://github.com/BioJulia/BioSequences.jl">BioSequences.jl</a>. It also has some nice properties, like being in alphabetical order, and that XOR-ing a base with 3 gives you its complement.</p><p>We could theoretically convert any DNA sequence to an integer, but 64-bit unsigned integers limit us to 32-mers.</p><p>Consider the DNA sequence <code>GATTACA</code>. If we convert it to an integer using the table above, we get <span>$2033010_4 = 10001111000100_2 = 9156_{10}$</span>, so the integer value of <code>GATTACA</code> is 9156. Since Julia uses 1-based indexing, we would add 1 to this value to get the index for the value in a vector associated with <code>GATTACA</code>.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../references/">API Reference »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Sunday 24 March 2024 02:13">Sunday 24 March 2024</span>. Using Julia version 1.10.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Integer representation of k-mers · VectorizedKmers.jl</title><meta name="title" content="Integer representation of k-mers · VectorizedKmers.jl"/><meta property="og:title" content="Integer representation of k-mers · VectorizedKmers.jl"/><meta property="twitter:title" content="Integer representation of k-mers · VectorizedKmers.jl"/><meta name="description" content="Documentation for VectorizedKmers.jl."/><meta property="og:description" content="Documentation for VectorizedKmers.jl."/><meta property="twitter:description" content="Documentation for VectorizedKmers.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="VectorizedKmers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">VectorizedKmers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li class="is-active"><a class="tocitem" href>Integer representation of k-mers</a></li><li><a class="tocitem" href="../references/">API Reference</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Integer representation of k-mers</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Integer representation of k-mers</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl/blob/main/docs/src/kmer_int_repr.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Integer-representation-of-K-mers"><a class="docs-heading-anchor" href="#Integer-representation-of-K-mers">Integer representation of K-mers</a><a id="Integer-representation-of-K-mers-1"></a><a class="docs-heading-anchor-permalink" href="#Integer-representation-of-K-mers" title="Permalink"></a></h1><p>This package relies on representing K-mers as integers for indexing.</p><p>For DNA, each non-ambiguous nucleotide is assigned a number between 0 and 3:</p><table><tr><th style="text-align: right">Nucleotide</th><th style="text-align: right">Base-4</th><th style="text-align: right">Base-2</th></tr><tr><td style="text-align: right">A</td><td style="text-align: right">0</td><td style="text-align: right">00</td></tr><tr><td style="text-align: right">C</td><td style="text-align: right">1</td><td style="text-align: right">01</td></tr><tr><td style="text-align: right">G</td><td style="text-align: right">2</td><td style="text-align: right">10</td></tr><tr><td style="text-align: right">T</td><td style="text-align: right">3</td><td style="text-align: right">11</td></tr></table><p>Any ordering works, but this is the one used by <a href="https://github.com/BioJulia/BioSequences.jl">BioSequences.jl</a>. It also has some nice properties, like being in alphabetical order, and that XOR-ing a base with 3 gives you its complement.</p><p>We could theoretically convert any DNA sequence to an integer, but 64-bit unsigned integers limit us to 32-mers.</p><p>Consider the DNA sequence <code>GATTACA</code>. If we convert it to an integer using the table above, we get <span>$2033010_4 = 10001111000100_2 = 9156_{10}$</span>, so the integer value of <code>GATTACA</code> is 9156. Since Julia uses 1-based indexing, we would add 1 to this value to get the index for the value in a vector associated with <code>GATTACA</code>.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../references/">API Reference »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Sunday 24 March 2024 16:55">Sunday 24 March 2024</span>. Using Julia version 1.10.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/objects.inv b/dev/objects.inv
index e5e1bec015dde105345aa920d6086503357a3fba..1c462bdc0bb10b14eb9b42a29a055c7890d283ed 100644
GIT binary patch
delta 414
zcmV;P0b%~`1DOPng@3J7O;5r=5WV+TjM3g$`8tpgE)`8AHN=q0m6UdnN<YYUi6H5J
zcXqeb77!wG+TAyA-pssd8pJ-XAr*etPds2vfafGaV-X>r1hJTKT=H_b5~#x<7Wj@F
z<O@@bg)4$2hGYf{M$FXUI75!HhiO^m2d#tkI)KscZ9J{rXnz^z?9iQJDEYT{zzia?
zf>@20^oUwj9HS=^9(<v1%Z`o+=R>gS5G<K7n;oW99Uy_<J<#dKOArMbJXU?Lzk4rP
zk+`gqvbUI$w$6$i+1gnR^lez&tawv{u^TE#o_2OBMxo6Tx4rHN-S7|08W1JR;B!u%
z!gg2#JA;p~8-JmgCq7jDKRb`vc~Eufw9U?=+3K6E#ts=CAXHid)LDg~-%8?=9GyT@
z5@5FbX4{gpUSCZ`q(rXVP%hG`q*PQYoc<DS&VC9q#p2xEsPzy^ODi_!=tUCL>sr%P
zaMu%>?8seSZNX-%dC`uG=F6UFM(*E%bxz6UoT7lrco&LSSp2_j6g^5~1Z^7j|2xe1
I20~%5#IJzMfB*mh

delta 370
zcmV-&0ge8d1nvWng@1ig%WlFj5WMFrwrX?3<#j-cxTLgFQL3myv{w|8O-V^?<QNqi
z<=?x`iv%bq+p{y{*|j4c-~mz<Sb^|?4FO(7g2pyMxr(@2N!-bHxl`!FoGbi73HpR7
za^<N=a1bGE88K5sVu%vCk7?WGg!aLHpTnX)8!rbh*~Jw*_J7tGYySNMFoTG!A$IF6
zKcaRWKhal0M!y(1I?)y5stML@f{Bnh>{8O502%y0gHmq1MM+eG*KR+{_vni~nd@89
zJNCgzl`Z!)lFZ3_Q10AjrzN(O-Z67~;1JpaJBObQ^$yu`YAw)<37T5=I#ts1RY@kA
zlaq<fk9kAa4r@7l$5w)BBLe7-^+rr{G%SOflcypUt&TZ)Hb*maG^i2x0%^h*)uQB7
zjB>FH$)LzbtF|}CGjnX~wrWZNblNL;l^q$>*HW|fv2E#|sJCB3k0#u6zj)pAKX<8Y
Qy3;y!+~F0?9}LhYmJd&~ga7~l

diff --git a/dev/references/index.html b/dev/references/index.html
index fcae7b2..c6af41f 100644
--- a/dev/references/index.html
+++ b/dev/references/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Reference · VectorizedKmers.jl</title><meta name="title" content="API Reference · VectorizedKmers.jl"/><meta property="og:title" content="API Reference · VectorizedKmers.jl"/><meta property="twitter:title" content="API Reference · VectorizedKmers.jl"/><meta name="description" content="Documentation for VectorizedKmers.jl."/><meta property="og:description" content="Documentation for VectorizedKmers.jl."/><meta property="twitter:description" content="Documentation for VectorizedKmers.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="VectorizedKmers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">VectorizedKmers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><a class="tocitem" href="../kmer_int_repr/">Integer representation of k-mers</a></li><li class="is-active"><a class="tocitem" href>API Reference</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Reference</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Reference</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl/blob/main/docs/src/references.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="API-Reference"><a class="docs-heading-anchor" href="#API-Reference">API Reference</a><a id="API-Reference-1"></a><a class="docs-heading-anchor-permalink" href="#API-Reference" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="VectorizedKmers.KmerArray" href="#VectorizedKmers.KmerArray"><code>VectorizedKmers.KmerArray</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">KmerArray{N, K, T &lt;: Real, A &lt;: AbstractArray{T, K}} &lt;: StaticArray{NTuple{K, N}, T, K}</code></pre><ul><li><code>N</code> is the alphabet size</li><li><code>K</code> is the K-mer size</li><li><code>T</code> is the element type</li><li><code>A</code> is the array type</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/anton083/VectorizedKmers.jl/blob/a00277a926ba24d8e6c196f3789f53cf4026b634/src/KmerArray.jl#L5-L12">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="VectorizedKmers.count_kmers" href="#VectorizedKmers.count_kmers"><code>VectorizedKmers.count_kmers</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">count_kmers(sequence, K, T=Int, zeros=zeros; N=default_alphabet_size(eltype(sequence)))</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/anton083/VectorizedKmers.jl/blob/a00277a926ba24d8e6c196f3789f53cf4026b634/src/count.jl#L18-L20">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="VectorizedKmers.count_kmers!-Union{Tuple{K}, Tuple{N}, Tuple{KmerArray{N, K, T, A} where {T&lt;:Real, A&lt;:AbstractArray{T, K}}, Any}} where {N, K}" href="#VectorizedKmers.count_kmers!-Union{Tuple{K}, Tuple{N}, Tuple{KmerArray{N, K, T, A} where {T&lt;:Real, A&lt;:AbstractArray{T, K}}, Any}} where {N, K}"><code>VectorizedKmers.count_kmers!</code></a> — <span class="docstring-category">Method</span></header><section><div><pre><code class="language-julia hljs">count_kmers!(kmer_array, sequence; reset=true)</code></pre><p>Requires method <code>axis_index(::KmerArray{N}, ::eltype(sequence)) where N</code> to be defined</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/anton083/VectorizedKmers.jl/blob/a00277a926ba24d8e6c196f3789f53cf4026b634/src/count.jl#L1-L5">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../kmer_int_repr/">« Integer representation of k-mers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Sunday 24 March 2024 02:13">Sunday 24 March 2024</span>. Using Julia version 1.10.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Reference · VectorizedKmers.jl</title><meta name="title" content="API Reference · VectorizedKmers.jl"/><meta property="og:title" content="API Reference · VectorizedKmers.jl"/><meta property="twitter:title" content="API Reference · VectorizedKmers.jl"/><meta name="description" content="Documentation for VectorizedKmers.jl."/><meta property="og:description" content="Documentation for VectorizedKmers.jl."/><meta property="twitter:description" content="Documentation for VectorizedKmers.jl."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="VectorizedKmers.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">VectorizedKmers.jl</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><a class="tocitem" href="../kmer_int_repr/">Integer representation of k-mers</a></li><li class="is-active"><a class="tocitem" href>API Reference</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Reference</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Reference</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/anton083/VectorizedKmers.jl/blob/main/docs/src/references.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="API-Reference"><a class="docs-heading-anchor" href="#API-Reference">API Reference</a><a id="API-Reference-1"></a><a class="docs-heading-anchor-permalink" href="#API-Reference" title="Permalink"></a></h1><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="VectorizedKmers.KmerArray" href="#VectorizedKmers.KmerArray"><code>VectorizedKmers.KmerArray</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">KmerArray{N, K, T &lt;: Real, A &lt;: AbstractArray{T, K}} &lt;: StaticArray{NTuple{K, N}, T, K}</code></pre><ul><li><code>N</code> is the alphabet size</li><li><code>K</code> is the K-mer size</li><li><code>T</code> is the element type</li><li><code>A</code> is the array type</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/anton083/VectorizedKmers.jl/blob/5bb8db93f411ea67ba3e0c1e52a3ce5c675cd3aa/src/KmerArray.jl#L5-L12">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="VectorizedKmers.count_kmers!-Union{Tuple{K}, Tuple{N}, Tuple{KmerArray{N, K, T, A} where {T&lt;:Real, A&lt;:AbstractArray{T, K}}, Any}} where {N, K}" href="#VectorizedKmers.count_kmers!-Union{Tuple{K}, Tuple{N}, Tuple{KmerArray{N, K, T, A} where {T&lt;:Real, A&lt;:AbstractArray{T, K}}, Any}} where {N, K}"><code>VectorizedKmers.count_kmers!</code></a> — <span class="docstring-category">Method</span></header><section><div><pre><code class="language-julia hljs">count_kmers!(kmer_array, sequence; reset=true)</code></pre><p>Requires method <code>axis_index(::KmerArray{N}, ::eltype(sequence)) where N</code> to be defined</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/anton083/VectorizedKmers.jl/blob/5bb8db93f411ea67ba3e0c1e52a3ce5c675cd3aa/src/count.jl#L1-L5">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="VectorizedKmers.count_kmers-Union{Tuple{K}, Tuple{N}, Tuple{Any, Val{N}, Val{K}}, Tuple{Any, Val{N}, Val{K}, Type{&lt;:Real}}, Tuple{Any, Val{N}, Val{K}, Type{&lt;:Real}, Any}} where {N, K}" href="#VectorizedKmers.count_kmers-Union{Tuple{K}, Tuple{N}, Tuple{Any, Val{N}, Val{K}}, Tuple{Any, Val{N}, Val{K}, Type{&lt;:Real}}, Tuple{Any, Val{N}, Val{K}, Type{&lt;:Real}, Any}} where {N, K}"><code>VectorizedKmers.count_kmers</code></a> — <span class="docstring-category">Method</span></header><section><div><pre><code class="language-julia hljs">count_kmers(sequence, [N,] K, T=Int, zeros=zeros)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/anton083/VectorizedKmers.jl/blob/5bb8db93f411ea67ba3e0c1e52a3ce5c675cd3aa/src/count.jl#L18-L20">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../kmer_int_repr/">« Integer representation of k-mers</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Sunday 24 March 2024 16:55">Sunday 24 March 2024</span>. Using Julia version 1.10.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/search_index.js b/dev/search_index.js
index 9b0f9d6..c4c5d33 100644
--- a/dev/search_index.js
+++ b/dev/search_index.js
@@ -1,3 +1,3 @@
 var documenterSearchIndex = {"docs":
-[{"location":"references/#API-Reference","page":"API Reference","title":"API Reference","text":"","category":"section"},{"location":"references/","page":"API Reference","title":"API Reference","text":"Modules = [VectorizedKmers]","category":"page"},{"location":"references/#VectorizedKmers.KmerArray","page":"API Reference","title":"VectorizedKmers.KmerArray","text":"KmerArray{N, K, T <: Real, A <: AbstractArray{T, K}} <: StaticArray{NTuple{K, N}, T, K}\n\nN is the alphabet size\nK is the K-mer size\nT is the element type\nA is the array type\n\n\n\n\n\n","category":"type"},{"location":"references/#VectorizedKmers.count_kmers","page":"API Reference","title":"VectorizedKmers.count_kmers","text":"count_kmers(sequence, K, T=Int, zeros=zeros; N=default_alphabet_size(eltype(sequence)))\n\n\n\n\n\n","category":"function"},{"location":"references/#VectorizedKmers.count_kmers!-Union{Tuple{K}, Tuple{N}, Tuple{KmerArray{N, K, T, A} where {T<:Real, A<:AbstractArray{T, K}}, Any}} where {N, K}","page":"API Reference","title":"VectorizedKmers.count_kmers!","text":"count_kmers!(kmer_array, sequence; reset=true)\n\nRequires method axis_index(::KmerArray{N}, ::eltype(sequence)) where N to be defined\n\n\n\n\n\n","category":"method"},{"location":"kmer_int_repr/#Integer-representation-of-K-mers","page":"Integer representation of k-mers","title":"Integer representation of K-mers","text":"","category":"section"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"This package relies on representing K-mers as integers for indexing.","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"For DNA, each non-ambiguous nucleotide is assigned a number between 0 and 3:","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"Nucleotide Base-4 Base-2\nA 0 00\nC 1 01\nG 2 10\nT 3 11","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"Any ordering works, but this is the one used by BioSequences.jl. It also has some nice properties, like being in alphabetical order, and that XOR-ing a base with 3 gives you its complement.","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"We could theoretically convert any DNA sequence to an integer, but 64-bit unsigned integers limit us to 32-mers.","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"Consider the DNA sequence GATTACA. If we convert it to an integer using the table above, we get 2033010_4 = 10001111000100_2 = 9156_10, so the integer value of GATTACA is 9156. Since Julia uses 1-based indexing, we would add 1 to this value to get the index for the value in a vector associated with GATTACA.","category":"page"},{"location":"","page":"Home","title":"Home","text":"CurrentModule = VectorizedKmers\nDocTestSetup = quote\n    using VectorizedKmers\nend","category":"page"},{"location":"#VectorizedKmers","page":"Home","title":"VectorizedKmers","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"(Image: Latest Release) (Image: MIT license) (Image: Documentation) (Image: Documentation) (Image: Status) (Image: Coverage)","category":"page"},{"location":"","page":"Home","title":"Home","text":"VectorizedKmers.jl is a Julia package primarily designed for fast K-mer counting of biological sequences. The core idea is that K-mers with an alphabet size of N are essentially integers in base N, and can be used as indices in a vector of size N^K to count the corresponding K-mers.","category":"page"},{"location":"","page":"Home","title":"Home","text":"This data structure can be used to quickly approximate distances between sequences. Notably, the squared Euclidean distance was used to approximate edit distance in this paper. The dot product has also proven to be a useful metric for comparing correlation between sequences.","category":"page"},{"location":"#Examples","page":"Home","title":"Examples","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"julia> using VectorizedKmers, BioSequences\n\njulia> kmer_array = count_kmers(dna\"ACCGGGTTTT\", 1)\nKmerArray{4, 1, Int64, Vector{Int64}} with size (4,)\n\njulia> kmer_array |> values\n4-element Vector{Int64}:\n 1\n 2\n 3\n 4\n\njulia> count_kmers(dna\"AATT\", 2) |> values # 2-mers of AATT\n4×4 Matrix{Int64}:\n 1  0  0  0\n 0  0  0  0\n 0  0  0  0\n 1  0  0  1\n\njulia> count_kmers(aa\"AY\", 1) |> values\n20-element Vector{Int64}:\n 1\n 0\n 0\n ⋮\n 0\n 1\n 0","category":"page"},{"location":"#Limitations","page":"Home","title":"Limitations","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"The main downside of counting K-mers this way is that the arrays grow exponentially with respect to K. The 31-mer array of a DNA sequence would have a length of 4^31 = 4611686018427387904, which is equivalent to four exbibytes of memory, if the values are stored with 8-bit integers — which is just not feasible, really. Not only does allocating a lot of memory take up a lot of memory, but it can also take a substantial amount of time! This method of counting K-mers therefore works best for lower K-values.","category":"page"}]
+[{"location":"references/#API-Reference","page":"API Reference","title":"API Reference","text":"","category":"section"},{"location":"references/","page":"API Reference","title":"API Reference","text":"Modules = [VectorizedKmers]","category":"page"},{"location":"references/#VectorizedKmers.KmerArray","page":"API Reference","title":"VectorizedKmers.KmerArray","text":"KmerArray{N, K, T <: Real, A <: AbstractArray{T, K}} <: StaticArray{NTuple{K, N}, T, K}\n\nN is the alphabet size\nK is the K-mer size\nT is the element type\nA is the array type\n\n\n\n\n\n","category":"type"},{"location":"references/#VectorizedKmers.count_kmers!-Union{Tuple{K}, Tuple{N}, Tuple{KmerArray{N, K, T, A} where {T<:Real, A<:AbstractArray{T, K}}, Any}} where {N, K}","page":"API Reference","title":"VectorizedKmers.count_kmers!","text":"count_kmers!(kmer_array, sequence; reset=true)\n\nRequires method axis_index(::KmerArray{N}, ::eltype(sequence)) where N to be defined\n\n\n\n\n\n","category":"method"},{"location":"references/#VectorizedKmers.count_kmers-Union{Tuple{K}, Tuple{N}, Tuple{Any, Val{N}, Val{K}}, Tuple{Any, Val{N}, Val{K}, Type{<:Real}}, Tuple{Any, Val{N}, Val{K}, Type{<:Real}, Any}} where {N, K}","page":"API Reference","title":"VectorizedKmers.count_kmers","text":"count_kmers(sequence, [N,] K, T=Int, zeros=zeros)\n\n\n\n\n\n","category":"method"},{"location":"kmer_int_repr/#Integer-representation-of-K-mers","page":"Integer representation of k-mers","title":"Integer representation of K-mers","text":"","category":"section"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"This package relies on representing K-mers as integers for indexing.","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"For DNA, each non-ambiguous nucleotide is assigned a number between 0 and 3:","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"Nucleotide Base-4 Base-2\nA 0 00\nC 1 01\nG 2 10\nT 3 11","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"Any ordering works, but this is the one used by BioSequences.jl. It also has some nice properties, like being in alphabetical order, and that XOR-ing a base with 3 gives you its complement.","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"We could theoretically convert any DNA sequence to an integer, but 64-bit unsigned integers limit us to 32-mers.","category":"page"},{"location":"kmer_int_repr/","page":"Integer representation of k-mers","title":"Integer representation of k-mers","text":"Consider the DNA sequence GATTACA. If we convert it to an integer using the table above, we get 2033010_4 = 10001111000100_2 = 9156_10, so the integer value of GATTACA is 9156. Since Julia uses 1-based indexing, we would add 1 to this value to get the index for the value in a vector associated with GATTACA.","category":"page"},{"location":"","page":"Home","title":"Home","text":"CurrentModule = VectorizedKmers\nDocTestSetup = quote\n    using VectorizedKmers\nend","category":"page"},{"location":"#VectorizedKmers","page":"Home","title":"VectorizedKmers","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"(Image: Latest Release) (Image: MIT license) (Image: Documentation) (Image: Documentation) (Image: Status) (Image: Coverage)","category":"page"},{"location":"","page":"Home","title":"Home","text":"VectorizedKmers.jl is a Julia package primarily designed for fast K-mer counting of biological sequences. The core idea is that K-mers with an alphabet size of N are essentially integers in base N, and can be used as indices in a vector of size N^K to count the corresponding K-mers.","category":"page"},{"location":"","page":"Home","title":"Home","text":"This data structure can be used to quickly approximate distances between sequences. Notably, the squared Euclidean distance was used to approximate edit distance in this paper. The dot product has also proven to be a useful metric for comparing correlation between sequences.","category":"page"},{"location":"#Examples","page":"Home","title":"Examples","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"julia> using VectorizedKmers, BioSequences\n\njulia> kmer_array = count_kmers(dna\"ACCGGGTTTT\", 1)\nKmerArray{4, 1, Int64, Vector{Int64}} with size (4,)\n\njulia> kmer_array |> values\n4-element Vector{Int64}:\n 1\n 2\n 3\n 4\n\njulia> count_kmers(dna\"AATT\", 2) |> values # 2-mers of AATT\n4×4 Matrix{Int64}:\n 1  0  0  0\n 0  0  0  0\n 0  0  0  0\n 1  0  0  1\n\njulia> count_kmers(aa\"AY\", 1) |> values\n20-element Vector{Int64}:\n 1\n 0\n 0\n ⋮\n 0\n 1\n 0","category":"page"},{"location":"#Limitations","page":"Home","title":"Limitations","text":"","category":"section"},{"location":"","page":"Home","title":"Home","text":"The main downside of counting K-mers this way is that the arrays grow exponentially with respect to K. The 31-mer array of a DNA sequence would have a length of 4^31 = 4611686018427387904, which is equivalent to four exbibytes of memory, if the values are stored with 8-bit integers — which is just not feasible, really. Not only does allocating a lot of memory take up a lot of memory, but it can also take a substantial amount of time! This method of counting K-mers therefore works best for lower K-values.","category":"page"}]
 }