From nobody Tue Jul 11 01:44:37 2023 X-Original-To: dev-commits-ports-main@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4R0Nw52jbYz4gx7h; Tue, 11 Jul 2023 01:44:37 +0000 (UTC) (envelope-from git@FreeBSD.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4R0Nw52Dhkz3CS0; Tue, 11 Jul 2023 01:44:37 +0000 (UTC) (envelope-from git@FreeBSD.org) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1689039877; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=EKQ4Pa/zjgVBtKBWO363SeoDb3gFsUmgwmZSl6/LCdg=; b=eQtBhWomrvvardCHVlKX8LwSpK1ly1BNLiimHBQIJZH1bQ1NwfNOZDzR2ZLn+uZYnVPwil XmuUVorvg9Ld39+pI3lm9qx3grZ+pEzWFXSF9h4mANamJ+OeEOgTvFht9gGb2m1eEearGq FFiq6cEW8UvcU0cZnmwWzm7Rx6F9FWMdfzbabWnXRmlAqU6LInjsLwza/P7UzWM9LnrjKz E5RVMRLiSyGFbaZd9IbaSRY0oSf0BJ7aTAlDWxrUCyiyytQff1W6qskEY1cw0WGv0oKt3a BWgKe3Nxoo7mepYSU9BpHOzUuRcdazj1VkzfPiazrjBEYsvfQjp8Br8/Op8jRA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1689039877; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=EKQ4Pa/zjgVBtKBWO363SeoDb3gFsUmgwmZSl6/LCdg=; b=umareRWYHTJF3wRn8S1C6cCcZK5NEUsz6vV3/Ty2abnukbefPavNU0QWQ9xqyEA89hodEi 5xPHSo0/UbtG2PTb85EamL78CWuVdqhPuqSgTUyX6F9ckKblait+8zMAqatKrjJDcBK7Y+ Muk1ThYDQu7o8y2XhLLbeIr8WJSrHNggdbD4uUREbfZ4VC10A5js4t9ntpEj1Phfd/lvhT KOCYgiCGk2s2AH4rcIOfIya4akulNhKbEipuHJ3JjP8dxwedV0JkZBrC2KdaspaCRMGW1m OKYrW5JsqMFxopD7x4lx+BDFy63VFTbmcXVWnRcmC10ki5QnPAFho3xiO0OTWA== ARC-Authentication-Results: i=1; mx1.freebsd.org; none ARC-Seal: i=1; s=dkim; d=freebsd.org; t=1689039877; a=rsa-sha256; cv=none; b=ReQ1+y1G94sRVllTohoRH1ZhO8EXESqV+/asecB31v7fgGqMZQZbEyRSofZ2IUfryBxJRQ EUeiZy9/j3To0jd4Rtt0yC8BKCmZIbJ6cb6GB6UXPVVm+nbmjeqO+OltJy1WACU6kM0dhU gT51rhXHUm0oZ5L7W1fXwC1g5NrB22VeMmRAgUz7z2kIGaU3jpCvJN4I2nrMNQ0/S4ovVJ oQwfEfupyTdahAnc+X8X5WPO9AGWjppgF1bL2dvJUiBklQIb8oHnAMmXOHdCS6T99TOTtX gvncm1PcZtQtGg1JRJA1I1aErnbivjta3q1qOF50Oa1Wlx4PA1rN3XWUG8AqHA== Received: from gitrepo.freebsd.org (gitrepo.freebsd.org [IPv6:2610:1c1:1:6068::e6a:5]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id 4R0Nw516jVz16nV; Tue, 11 Jul 2023 01:44:37 +0000 (UTC) (envelope-from git@FreeBSD.org) Received: from gitrepo.freebsd.org ([127.0.1.44]) by gitrepo.freebsd.org (8.17.1/8.17.1) with ESMTP id 36B1ibXh080833; Tue, 11 Jul 2023 01:44:37 GMT (envelope-from git@gitrepo.freebsd.org) Received: (from git@localhost) by gitrepo.freebsd.org (8.17.1/8.17.1/Submit) id 36B1ibSc080832; Tue, 11 Jul 2023 01:44:37 GMT (envelope-from git) Date: Tue, 11 Jul 2023 01:44:37 GMT Message-Id: <202307110144.36B1ibSc080832@gitrepo.freebsd.org> To: ports-committers@FreeBSD.org, dev-commits-ports-all@FreeBSD.org, dev-commits-ports-main@FreeBSD.org From: Yuri Victorovich Subject: git: 251245030286 - main - biology/mashmap: New port: Fast approximate aligner for long DNA sequences List-Id: Commits to the main branch of the FreeBSD ports repository List-Archive: https://lists.freebsd.org/archives/dev-commits-ports-main List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-dev-commits-ports-main@freebsd.org X-BeenThere: dev-commits-ports-main@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Git-Committer: yuri X-Git-Repository: ports X-Git-Refname: refs/heads/main X-Git-Reftype: branch X-Git-Commit: 2512450302863b1bcfc8f3b3dc216b1ec64091c1 Auto-Submitted: auto-generated X-ThisMailContainsUnwantedMimeParts: N The branch main has been updated by yuri: URL: https://cgit.FreeBSD.org/ports/commit/?id=2512450302863b1bcfc8f3b3dc216b1ec64091c1 commit 2512450302863b1bcfc8f3b3dc216b1ec64091c1 Author: Yuri Victorovich AuthorDate: 2023-07-11 01:44:10 +0000 Commit: Yuri Victorovich CommitDate: 2023-07-11 01:44:34 +0000 biology/mashmap: New port: Fast approximate aligner for long DNA sequences --- biology/Makefile | 1 + biology/mashmap/Makefile | 24 ++++++++++++++++++++++++ biology/mashmap/distinfo | 3 +++ biology/mashmap/pkg-descr | 11 +++++++++++ 4 files changed, 39 insertions(+) diff --git a/biology/Makefile b/biology/Makefile index fe34cafefca7..71d625b98e9f 100644 --- a/biology/Makefile +++ b/biology/Makefile @@ -97,6 +97,7 @@ SUBDIR += linux-foldingathome SUBDIR += mafft SUBDIR += mapm3 + SUBDIR += mashmap SUBDIR += megahit SUBDIR += metaeuk SUBDIR += migrate diff --git a/biology/mashmap/Makefile b/biology/mashmap/Makefile new file mode 100644 index 000000000000..d9dc5f5fafa4 --- /dev/null +++ b/biology/mashmap/Makefile @@ -0,0 +1,24 @@ +PORTNAME= mashmap +DISTVERSIONPREFIX= v +DISTVERSION= 3.0.6 +CATEGORIES= biology + +MAINTAINER= yuri@FreeBSD.org +COMMENT= Fast approximate aligner for long DNA sequences +WWW= https://github.com/marbl/MashMap + +LICENSE= PD +LICENSE_FILE= ${WRKSRC}/LICENSE.txt + +LIB_DEPENDS= libgsl.so:math/gsl + +USES= cmake compiler:c++17-lang localbase:ldflags + +USE_GITHUB= yes +GH_ACCOUNT= marbl +GH_PROJECT= MashMap + +PLIST_FILES= bin/mashmap \ + bin/mashmap-align + +.include diff --git a/biology/mashmap/distinfo b/biology/mashmap/distinfo new file mode 100644 index 000000000000..7cbf1e62148e --- /dev/null +++ b/biology/mashmap/distinfo @@ -0,0 +1,3 @@ +TIMESTAMP = 1689039441 +SHA256 (marbl-MashMap-v3.0.6_GH0.tar.gz) = 53d1b4efad6650d8efbc28325637d1bdacd108eaad70fcc28e927f40ac5c2112 +SIZE (marbl-MashMap-v3.0.6_GH0.tar.gz) = 234093 diff --git a/biology/mashmap/pkg-descr b/biology/mashmap/pkg-descr new file mode 100644 index 000000000000..4460733166d2 --- /dev/null +++ b/biology/mashmap/pkg-descr @@ -0,0 +1,11 @@ +MashMap implements a fast and approximate algorithm for computing local +alignment boundaries between long DNA sequences. It can be useful for mapping +genome assembly or long reads (PacBio/ONT) to reference genome(s). Given a +minimum alignment length and an identity threshold for the desired local +alignments, Mashmap computes alignment boundaries and identity estimates using +k-mers. It does not compute the alignments explicitly, but rather estimates an +unbiased k-mer based Jaccard similarity using a combination of minmers (a novel +winnowing scheme) and MinHash. This is then converted to an estimate of sequence +identity using the Mash distance. An appropriate k-mer sampling rate is +automatically determined using the given minimum local alignment length and +identity thresholds.