**Graph-based clustering using a decoupling strategy for hard-clustering**

*The principle is similar for soft-clustering. Gray nodes represent TFs nodes. The T-label problem is decomposed into T binary sub-problems by setting the component t of marker labels s(t); t 2 T, to one and the others to zero. Each sub-problem t leads to a probability for each node. The final node clustering corresponds to the label whose probability amidst the T sub-problems is maximal.*

**Availability** – BRANE Clust software is available at: http://www-syscom.univ-mlv.fr/~pirayre/Codes-GRN-BRANE-clust.html

Pirayre A, Couprie C, Duval L, Pesquet JC. (2017) **BRANE Clust: Cluster-Assisted Gene Regulatory Network Inference Refinement.** *IEEE/ACM Trans Comput Biol Bioinform* [Epub ahead of print]. [abstract]

Weighting all possible pairwise gene relationships by a probability of edge presence, researchers from IFP Energies Nouvelles, France formulate the regulatory network inference as a discrete variational problem on graphs. They enforce biologically plausible coupling between groups and types of genes by minimizing an edge labeling functional coding for a priori structures. The optimization is carried out with Graph cuts, an approach popular in image processing and computer vision. The researchers compare the inferred regulatory networks to results achieved by the mutual-information-based Context Likelihood of Relatedness (CLR) method and by the state-of-the-art GENIE3, winner of the DREAM4 multifactorial challenge.

*Schematic view of the proposed BRANE Cut method. The initial graph ( a) is transformed into an intermediate graph (b) in which a max-flow computation is performed to return an optimal edge labeling x ^{∗} leading to the inferred graph (c). We choose to present the method in its full generality with unscaled weights (i.e. w _{i,j}∈ [ 0,+∞[, and λ parameters also belong to [ 0,+∞[. Nodes v _{2} and v _{3} are TFs, λTF¯=1 and λ _{TF}=3. Taking γ=4 implies that v _{1}, v _{2}, and v _{3} satisfy the regulator coupling property. Vertices v _{1} and v _{4} are thus affected, leading to the presence of additional edges weighted by ρ _{1,2,3}=0 and ρ _{4,2,3}=3, when μ is set to 3. Computing a max-flow in the graph (b) leads to some edge saturation, represented in dashed lines. The values from the source (value 1) and the sink (value 0) are propagated through non saturated paths, thus leading to x _{2,4}=x _{3,4}=0*

TheBRANE Cut approach infers more accurately the five DREAM4 in silico networks (with improvements from 6 % to 11 %). On a real Escherichia coli compendium, an improvement of 11.8 % compared to CLR and 3 % compared to GENIE3 is obtained in terms of Area Under Precision-Recall curve. Up to 48 additional verified interactions are obtained over GENIE3 for a given precision. On this dataset involving 4345 genes, our method achieves a performance similar to that of GENIE3, while being more than seven times faster.

BRANE Cut is a weighted graph thresholding method. Using biologically sound penalties and data-driven parameters, it improves three state-of-the art GRN inference methods. It is applicable as a generic network inference post-processing, due to its computational efficiency.

**Availability** – The BRANE Cut code is available at: http://www-syscom.univ-mlv.fr/~pirayre/Codes-GRN-BRANE-cut.html

Pirayre A, Couprie C, Bidard F, Duval L, Pesquet JC. (2015) **BRANE Cut: biologically-related a priori network enhancement with graph cuts for gene regulatory network inference**. *BMC Bioinformatics* 16:369. [article]

Here researchers from the Broad Institute of MIT and Harvard present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. They applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. They confirmed Seurat’s accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.

**Availability** – Seurat is available as an open-source software package in R. The full code, visual tutorials, and more can be accessed at www.satijalab.org/seurat.

Satija R, Farrell JA, Gennert D, Schier AF, Regev A. (2015) **Spatial reconstruction of single-cell gene expression data.** *Nat Biotechnol* [Epub ahead of print].[abstract]