# Can I use exome data for CNV detection?

CNV calling from exome FASTQ is done automatically! A prerequisite for this is [defining a Panel of Normals (PON) per each enrichment kit](/emedgene/emedgene-analyze-manual/settings/organization_settings_-330+/kit-management/panel-of-normals-pon/attaching-a-pon-to-a-coverage-bed-kit.md) you're using.

A PON aids to set a baseline coverage pattern and account for recurrent technical artifacts that are specific to your workflow. Depth of coverage per each sequenced region is averaged across PON samples; if a significant increase or decrease from this baseline is detected in a test sample, a CNV is called.

### Recommendations for creating a PON to call CNVs from exome data:

1. Samples for a PON should be derived from healthy individuals.
2. In our experience, a PON of at least 40-50 samples yields the best results. A smaller PON is better than nothing, but keep in mind that you may encounter more false positives.
3. You should aim at preparing samples for a PON in a unified manner to avoid the batch effect. Please log differences in library preparation (if any).


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.connected.illumina.com/emedgene/frequently-asked-questions/all-faq/can_i_use_exome_data_for_cnv_detection.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
