excludeDoubleMatchingProbes  package:GeneRegionScan  R Documentation

_E_x_c_l_u_d_e _P_r_o_b_e_L_e_v_e_l_S_e_t _p_r_o_b_e_s _t_h_a_t _m_a_t_c_h _m_o_r_e _t_h_a_n _o_n_c_e _i_n _g_e_n_o_m_e

_D_e_s_c_r_i_p_t_i_o_n:

     Function that will remove probes from ProbeLevelSet if they have
     more than one match in a given genome.

_U_s_a_g_e:

         excludeDoubleMatchingProbes(object, genome="BSgenome.Hsapiens.UCSC.hg18", verbose=TRUE,
         directions=c("matchForwardSense", "matchForwardAntisense", "matchReverseSense", "matchReverseAntisense"),
         previousData = NULL)

_A_r_g_u_m_e_n_t_s:

  object: A ProbeLevelSet class object.

  genome: character string with the name of the BSGenome in which
          sequences should be found. Defaults to the human genome.

 verbose: TRUE or FALSE.

directions: character string with elements from c("matchForwardSense",
          "matchForwardAntisense", "matchReverseSense",
          "matchReverseAntisense"). Defines which directions
          (complementary and reverse mirrorings) that should be
          scanned. Defaults to all directions.

previousData: Optional: The output from a call to findSequenceInGenome.
          If given the scanning will be skipped, and the probes will be
          omitted directly. Useful in cases were datasets from the same
          region needs to be processed.

_D_e_t_a_i_l_s:

     This function will take quite a while to run, so if you have many
     sequences, overnight runs are recommended.  BSgenome contains some
     alternative versions of chromosomes. They are marked with an
     underscore. This function  automatically disregards chromosome
     names with an underscore, and this is known to work for the human
     genome. Nevertheless,  check the output printed to terminal if all
     chromosomes are included. The function is a wrapper around
     findSequenceInGenome,  which can be used for purposes that are
     more flexible (although that function really is just following the
     example in the BSgenome package)

     At present, there is no functionality to check matches with known
     SNP or known splice forms taken into account.

_V_a_l_u_e:

     The ProbeLevelSet class object provided as argument, with all
     double matching probes removed. Double matching probes are probes
     whose sequence are found twice or more in the genome. In addition,
     the output of the matching investigation is saved in the notes of
     the ProbeLevelSet and can be further examined for information on
     the locations of the probe sequences in relation to the BSgenome
     sequences.

_A_u_t_h_o_r(_s):

     Lasse Folkersen

_S_e_e _A_l_s_o:

     'findSequenceInGenome', 'BSgenome', 'excludeDoubleMatchingProbes'

_E_x_a_m_p_l_e_s:

             ## Not run: 
             #you can run this, but it takes a lot of time
             probelevelsetwithnodoubles<-excludeDoubleMatchingProbes(probelevelset)
             
     ## End(Not run)

