open_projects
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| open_projects [2024/01/24 08:22] – project | open_projects [2025/10/17 17:30] (current) – project | ||
|---|---|---|---|
| Line 4: | Line 4: | ||
| If you are a potential supervisor, [[supervisor_instructions: | If you are a potential supervisor, [[supervisor_instructions: | ||
| + | |||
| + | === The Barrier Atlas: Cross-Tissue Insights into Homeostasis and Dysfunction === | ||
| + | |||
| + | Contact: Amanda Oliver (Amanda.Oliver@qimrb.edu.au) | ||
| + | |||
| + | Single-cell RNA sequencing (scRNA-seq) has transformed our understanding of human tissue biology, revealing cellular diversity across organs and disease states. Building on existing datasets profiling millions of cells, this project aims to construct a unified single-cell atlas of barrier tissues, including the lung and gut, to uncover shared and tissue-specific mechanisms that maintain immune balance at the body’s environmental interfaces. The student will develop and apply computational pipelines for large-scale data integration, | ||
| + | |||
| + | Suitable for Masters, or PhD students. Strong bioinformatics skills using Python or R are essential; experience with single-cell or spatial transcriptomics and knowledge of immunology or barrier tissue biology is highly desirable. | ||
| + | |||
| + | === The Escape of Human Genomic Data into Public Repositories === | ||
| + | |||
| + | Contact: Michael Hall (michael.hall1@uq.edu.au) | ||
| + | |||
| + | Public sequencing repositories (e.g. SRA) are growing rapidly, but many studies involving human clinical samples may inadvertently include identifiable host DNA—even when ethics approvals explicitly prohibit this. This project investigates the extent and implications of such data leakage. | ||
| + | |||
| + | Objectives: | ||
| + | • Identify publicly available datasets from clinical pathogen/ | ||
| + | • Quantify residual human genomic content using a variety of approaches and references | ||
| + | • Benchmark human read detection approaches (e.g. host depletion vs k-mer-based methods) | ||
| + | • Assess potential identifiability using forensic markers (e.g. Illumina Infinium SNPs, CODIS loci) | ||
| + | • Explore the role of ethics language, technical variability, | ||
| + | |||
| + | Skills you’ll gain: | ||
| + | • Handling and processing large sequencing datasets | ||
| + | • Working knowledge of alignment and k-mer classification tools (e.g. minimap2, kraken) and human read detection pipelines | ||
| + | • Experience in reproducible bioinformatics analysis and privacy-aware genomic research | ||
| + | • Insight into the intersection of ethics, bioinformatics, | ||
| + | |||
| + | This project is ideal for students interested in clinical genomics, privacy, ethics, or data-driven policy impact. Familiarity with the command line is necessary. Knowledge of Python would be great, but not required—we can build those skills as you go! | ||
| + | |||
| + | === Decoding the relationships between DNA replication, | ||
| + | |||
| + | Contact: Dr Mathew Jones (mathew.jones@uq.edu.au) | ||
| + | |||
| + | The human genome is packaged into chromatin and assembled into 3D self-interacting chromatin domains that regulate gene expression and coordinate the process of DNA replication. Understanding the relationships between genome structure and function is one of the outstanding challenges in modern biology. Changes in the 3D structure of the genome can cause copying errors (genetic mutations) during DNA replication that results in diseases such as cancer and advanced aging. Decoding the relationships between the genomic landscape and cellular processes such as DNA replication has the potential to inform the development of novel treatments that can treat cancer and extend longevity. | ||
| + | |||
| + | In this project we are seeking talented and enthusiastic postgraduate students to tackle two fundamental questions: 1. How does the epigenome and the 3D organisation of the genome regulate DNA replication? | ||
| + | |||
| + | Bioinformatics and Computer Science students with skills in R, Python and C++ that are familiar with software suites for the comparison, manipulation and annotation of genomic features are encouraged to contact Dr Mathew Jones (mathew.jones@uq.edu.au) to learn more about the projects available. | ||
| + | |||
| + | === Pangenomes to predict bacterial transmission in healthcare settings === | ||
| + | |||
| + | Contacts: Leah Roberts l.roberts3@uq.edu.au, | ||
| + | |||
| + | Predicting whether two bacterial isolates are the same (and thereby inferring if transmission has occurred) has traditionally been performed by identifying and counting single nucleotide variants (SNVs). To do this, a reference genome is usually selected, and isolate reads are mapped to the reference to identify SNVs in regions shared between all isolates. However, for large datasets of very diverse bacterial strains, a single reference genome is usually insufficient, | ||
| + | |||
| + | We propose a novel method using pangenome reference graphs to better identify and discriminate transmission of bacterial pathogens. This project would start to build test datasets and develop novel workflows for predicting transmission from pangenome graphs. | ||
| + | |||
| + | This project is suitable for an honours, Masters, or PhD student. Background in command line, HPC and python is highly desirable. This project will be based at UQCCR (Herston Campus) and co-supervised by Dr Michael Hall (University of Melbourne). | ||
| + | |||
| + | |||
| + | === Investigation of the effect of the circadian rhythm on the genetic control of gene expression === | ||
| + | |||
| + | Contact: Sonia shah < | ||
| + | |||
| + | The circadian rhythm reflects the daily cycle of behaviours and metabolic processes organisms exhibit. A 24-hour gene expression pattern occurs at the molecular level, with genes activated either during the day or night. Different tissues all display circadian control, with some more affected than others. Within the liver, for example, 3000 genes are subjected to circadian control. This regulation is orchestrated by a small group of CLOCK genes, establishing feedback loops that result in rhythmic gene expression in every tissue. | ||
| + | |||
| + | We know that gene expression can be influences by genetics variants, called expression quantitative trait loci (eQTL), and this may be one mechanism linking genetic variants to disease. As a result, large eQTL datasets have been generated to assist in understanding disease mechanisms. However, it remains unknown whether sample collection time can affect eQTL identification. This project therefore aims to identify the possible effects of the circadian rhythm on the genetic control of gene expression using the Genotype-Tissue expression (GTEx) dataset. | ||
| + | |||
| + | During this project, you will run Python tools such as PEER and tensorQTL to identify eQTL within 49 tissues. You will subsequently investigate the associations identified and follow up on the role of the genes under circadian controls within different phenotypes. | ||
| + | |||
| + | === Understanding the influence of taste and olfactory perception on eating behaviour and health conditions using big genetic data === | ||
| + | |||
| + | Contact info: Daniel Hwang < | ||
| + | |||
| + | Project description: | ||
| + | |||
| + | === Increasing drug success rate in human clinical trials using genomics === | ||
| + | |||
| + | Around 90% of drug candidates fail in human clinical trials largely due to lack of efficacy or safety concerns. This partly reflects the limitations of using in vitro and animal studies to predict the effect of compounds in humans. Recent studies highlight that drug targets backed by evidence from human genetic studies are 2 times more likely to make it to market. Human genetic data can also identify potential adverse side effects. Such information prior to embarking on human clinical trials could improve the success rate of a compound in human clinical trials and help avoid adverse outcomes for participants. This project will use statistical genomics analyses using publicly available human genomic data to predict efficacy as well as any safety concerns of compounds that are currently in the drug development pipeline. | ||
| + | |||
| + | Project significance: | ||
| + | |||
| + | Contact: Sonia Shah < | ||
| + | |||
| + | Supervisors: | ||
| + | |||
| + | Familiarity with computational analyses (e.g using R or python etc) is needed for this project. Some knowledge around genome-wide association studies and statistical genomics methods such as Mendelian randomisation analysis would be beneficial | ||
| + | |||
| + | === Developing quiescent stem cell classifier using single cell transcriptomics === | ||
| + | Contact info: Dr Lachlan Harris (Lachlan.Harris@qimrberghofer.edu.au), | ||
| + | |||
| + | Quiescence is a reversible state of cell-cycle arrest, sometimes referred to as the “G0” phase of the cell-cycle. It is an adaptive feature of most adult stem cell populations, | ||
| + | |||
| + | The most common classifiers that define cell-cycle stages (G1/S/G2/M) in single-cell RNA-sequencing data (scRNA- seq) were trained on populations of actively cycling cells. Therefore, these tools cannot identify quiescent stem cells in “G0” phase of the cell-cycle. It is an outstanding question as to whether there are sufficient transcriptomic similarities across quiescent stem cells from different tissue types to build a generalisable model to discriminate these cellular populations. Furthermore, | ||
| + | |||
| + | This project aims to develop a broadly applicable quiescent classifier. As a first step towards this, this project will seek to 1) contribute to the curation of datasets and isolation of tissue-agnostic and tissue-specific feature sets that define quiescent stem cells and 2) compare methods for training quiescent classifiers and for determining the most salient features. | ||
| + | |||
| + | |||
| + | === Understanding sex-specific cardiovascular disease risk === | ||
| + | |||
| + | Contact info: Dr Sonia Shah (sonia.shah@imb.uq.edu.au), | ||
| + | |||
| + | Description: | ||
| + | |||
| + | Requirements: | ||
| === De-risking the drug development pipeline by finding biomarkers of drug action === | === De-risking the drug development pipeline by finding biomarkers of drug action === | ||
| Line 123: | Line 219: | ||
| The project is embedded in the [[https:// | The project is embedded in the [[https:// | ||
| + | |||
| + | |||
| + | === Decoding Transcription Factor Dosage Effects on Cell State Transitions with DoseH-Seq === | ||
| + | |||
| + | Contact info: Dr Christian Nefzger (c.nefzger@imb.uq.edu.au), | ||
| + | |||
| + | Cell identity is controlled by different combinations of transcription factors (TFs) that bind to genomic regulatory elements to regulate gene expression. TF activity is not binary in most instances but graded and in response to TF dosage levels (e.g., Naqvi et al., Nat Genet., 2023, PMID: 37024583). For this reason, TFs are strongly enriched for haploinsufficient disease associations (Seidman et al, 2002, J. Clin. Invest. PMID: 11854316; Van de Lee et al., 2020, Trends Genet., PMID: 32451166) and TF dosage and stoichiometry strongly affects reprogramming outcomes (e.g., Polo et al, 2012, Cell, PMID: 32939092; An et al., 2019, Cell Reports, PMID: 31722212). Furthermore, | ||
| + | |||
| + | Single-cell RNA+ATAC-seq is a uniquely powerful assay to measure the impact of TF levels on cell regulatory architecture; | ||
| + | |||
| + | The applicant would help drive method establishment around our novel DoseH-seq technique and support analysis to understand TFs dosage effects with established data sets. Ideal candidate will be able to efficiently program in R or Python. This project is looking for bioinformatics Masters students (ideally 16 units, but we consider 8 unit applicants as well. We also consider PhD students. | ||
| + | |||
open_projects.1706044958.txt.gz · Last modified: 2024/01/24 08:22 by project