%\VignetteIndexEntry{keggorth overview} %\VignetteDepends{hgu95av2.db, graph, RBGL, ALL} %\VignetteKeywords{Annotation, Pathways} %\VignettePackage{keggorth} % % NOTE -- ONLY EDIT THE .Rnw FILE!!! The .tex file is % likely to be overwritten. % \documentclass[12pt]{article} \usepackage{amsmath} \usepackage[authoryear,round]{natbib} \usepackage{hyperref} \textwidth=6.2in \textheight=8.5in %\parskip=.3cm \oddsidemargin=.1in \evensidemargin=.1in \headheight=-.3in \newcommand{\scscst}{\scriptscriptstyle} \newcommand{\scst}{\scriptstyle} \newcommand{\Rfunction}[1]{{\texttt{#1}}} \newcommand{\Robject}[1]{{\texttt{#1}}} \newcommand{\Rpackage}[1]{{\textit{#1}}} \newcommand{\Rmethod}[1]{{\texttt{#1}}} \newcommand{\Rfunarg}[1]{{\texttt{#1}}} \newcommand{\Rclass}[1]{{\textit{#1}}} \textwidth=6.2in \bibliographystyle{plainnat} \begin{document} %\setkeys{Gin}{width=0.55\textwidth} \title{\Rpackage{keggorth}: the KEGG orthology as \Robject{graph}} \author{VJ Carey}% \verb++} \maketitle \tableofcontents \section{Introduction} KEGG is the Kyoto Encyclopedia of Genes and Genomes. An important product of the KEGG group is a catalog of pathways. The KEGG Orthology (KO) organizes the pathways into a conceptual hierarchy. This package encodes the hierarchy as a graph, and provides some support for deriving sets of array feature identifiers from the hierarchy. \section{\Robject{KOgraph}} <>= library(keggorth) data(KOgraph) KOgraph nodes(KOgraph)[1:5] @ The upper component of the hierarchy is: <>= adj(KOgraph, nodes(KOgraph)[1]) @ Graph operations can be used to explore the orthology. For example, the context of the PPAR signaling pathway is found as follows: <>= library(RBGL) sp.between(KOgraph, nodes(KOgraph)[1], "PPAR signaling pathway") @ Fixed-length identifiers are used to label pathways. These are available as the 'tag' nodeData attribute. <>= nodeData(KOgraph,,"tag")[1:5] @ The depth of each term is also available. <>= nodeData(KOgraph,,"depth")[1:5] @ \section{Application to gene filtering} Several functions are available for retrieving relevant information from the orthology. If you know a substring of the pathway name of interest, you can obtain the numerical tag(s). <>= getKOtags("insulin") @ We can get probe set identifiers corresponding to a term. The default chip annotation package used is hgu95av2.db. <>= library(hgu95av2.db) mp = getKOprobes("Methionine") library(ALL) data(ALL) ALL[mp,] @ \section{Infrastructure considerations} The graph was built through manual massaging of an HTML page in which the orthology was expanded to level 3. This operation needs to be programmatic, and the version number needs to be made clear. The HTML source from which this was built is in inst/keggHTML in the source package. \section{Session info} <>= sessionInfo() @ \end{document}