The theory of stratified sampling deals with the properties of the sampling distribution of the estimators and with different types of sampling. Survey weights (also called sampling weights or probability weights) indicate that an observation in a survey represents a certain number of people in a finite population. The weighted arithmetic mean is similar to an ordinary arithmetic mean (the most common type of average), except that instead of each of the data points contributing equally to the final average, some data points contribute more than others. The notion of weighted mean plays a role in descriptive statistics and also occurs in a more general form in several other areas of mathematics. Reservoir-type uniform sampling algorithms over data streams are discussed in [12]. When the population is heterogeneous, dividing the whole population into sub-populations, called strata, can increase the precision of the estimates. A self-weighting sample, usually in respect of the total of the entire population, is generally incorporated in a sample-design to simplify tabulation work, because the population total is proportional to the sample total. The strata should not overlap and each stratum should be sampled following some design. In weighted random sampling (WRS) the items are weighted and the probability of each item to be selected is determined by its relative weight. Investigators are often interested in estimating quantities (such as means, counts, or proportions) in a population by using a representative sample selected from that population. The most straightforward type of probability sampling design, a simple random sample (SRS), is a selection method in which each sample has the same probability of being selected. Sample weights are created, and weighted and unweighted means are calculated. A variable named "score" is created with different means for Regions A and B. (a) Audit sampling (sampling) – The application of audit procedures to less than 100% of items within a population of audit relevance such that all sampling units have a chance of selection in order to provide the auditor with a reasonable basis on which to draw conclusions about the entire population. In an SRS, the probability of selection of each member in the population is the same. When data must be weighted, try to minimize the sizes of the weights. In some cases, the weight of a given unit may be interpreted as the number of units from the population that are represented by this sample unit. In statistics, quality assurance, and survey methodology, sampling is the selection of a subset (a statistical sample) of individuals from within a statistical population to estimate characteristics of the whole population. WRS can be defined with the following algorithm D: Sampling of Discharges: Grab Sample are taken from a waste stream without regard to the flow of the waste stream and over a period of time not to exceed 15 minutes. A composite sample is prepared by combining a series of grab samples. All strata must be sampled. The strata are sampled separately and the estimates from each stratum combined into one estimate for the whole population. Cluster sampling is defined as a sampling method where the researcher creates multiple clusters of people from a population where they are indicative of homogeneous characteristics and have an equal chance of being a part of the sample. Quota sampling is a sampling methodology wherein data is collected from a homogeneous group. The main advantage of stratified random sampling is that it captures key population characteristics in the sample. To define a k-mer ordering needed for weighted minimizer sampling, we borrow the optimized hashing technique. Audit sampling is defined as the application of an audit procedure to less than 100% within a population of the audit. Hence, auditors need to use sampling techniques. Population refers to any group of records or documents in the audit relevance that belongs in a specific category. It is impossible to review every record of the client. 