View Full Version : What is Phil Working on?
Welcome to the Active Low-Carber Forums
Support for Atkins diet, Protein Power, Neanderthin (Paleo Diet), CAD/CALP, Dr. Bernstein Diabetes Solution and any other healthy low-carb diet or plan, all are welcome in our lowcarb community. Forget starvation and fad diets -- join the healthy eating crowd! You may register by clicking here, it's free!
Philip Dei
Tue, Jul-15-03, 06:12
In my continued effort to provide flamebait for this group .
. .
I have taken a rather, in hindsite painful task.
I have created a program that runs monte-carlo similuations.
Currently I have 12 xlinked popstudies that can be compared
with randomly generated sets based on different assumptions.
The initial runs gave pleasing results however a couple of
issues have bothered me.
Since I figured I had 12 individual author bias would be drawn
out by averages. But that the results of a couple of these
Il2rg and Ids loci would never be hit even by monte carlo.
This is problematic since Monte Carlo analysis could be
imployed to look at specific deviations, such as a particular
MRCA that is say in the 99.9% of ages. IOW I might have 10
loci that fit perfectly and have 2 that are so far in left
feild . . . .
All was going well until I ask the basic question. What is it
that we need to get to know exactly when an MRCA was.
1. The mutation rate (exactly)
2. The precise average of substitutions off the seqMRCA in the
current population.
I have just finished one Loci and I have found all kinds of
things that cause worry.
Plp locus 2 variant positions in humans 50% of sample have 1
variant derivative while 50% have seqMRCA. The difference
between the seqMRCA is 5 to chimpanzee, 10 for gorilla.
The binomial distribution on mutation rates based on 5
sequence differences ranges from .031 to 2.24 times the
calculated rate. For 10 it is a little better. The 96%
confidence range for relative rate differences between chimp
and gorilla is from gorilla evolving 5.5 times that off chimp
to 0.45. Thus these rates might apply to an evolutionary
segment like humans despite the calculated mutation rate.
This basic analysis has determined that a low number of
mutations between chimp and humans gets refactored into the
analysis on several levels. The variants at the loci may not
be reponding to the observed rate of mutation but a real rate
that is faster or slower than would be inferred from the
chimp/human absolute differences if those differences are
small. If there are 5 differences between chimp and human we
assume for instance that each difference represents
3.4 million years, and that since each individuals has 0.5
derivatives then the MRCA is 1.2 million years. But the
variance of potential differences of chimp and gorilla rates
could mean that the difference represents a few hundred
thousand or millions of years.
The basic bottom line. In order for MonteCarlo to be able to
compare human Derived MRCA versus the MRCAs expected in an
idealized model the human factor needs to be factored.
This factor can be broken down
4. Variance in sample size
a. Its effect in predicting the depth of mutations at a
site, and thus range of variation that could explain
observed values.
b. Its effect on hitting the deepest variants. For
example you may have to have 50 african samples to
find the earliest offshoot branch of seqMRCA, this
will affect the MRCA calculation. One can actually
test this by randomly adding a known sample to the
data set and observe how the MRCA decreases with
added sample.
2. Sample site selection, did someone choose a site with a
handful of differences with chimpanzee. How does this
selection affect the external anchoring of mutation rates
3. Observations of potential variance based on differencese
of rates between humans, chimps and gorilla. To get at
this the cloud of low sampling and rate variance needs to
be removed. If the rate is removed how much are the rate
variances between gorilla and chimp be applicable. This
is a human factor because for example C/H distance may be
shorter than H/G arguing for slower evolution on the C
side of a branch and faster on H side, lowering MRCAs.
Each of these human errors can be randomly applied to an
randomly seleted MRCA inorder to give a humanizedMRCA that is
of publishable quality. this can be returned by the
MonteCarlo analysis and retested.
Based on analysis of the data set for which the similated
'humanization' process occurs I found several descrepancies
that I need to report, for the sake of keeping everyone
informed on molecular foul ups.
Xq21.2 (Yu et al) estimated human MRCA at 741 and popsize
about 14000. I thought this was a good paper, until I
started testing the observed differences in mutations C/H
and G/H, the authors should have provided C/G. It is
unfortunated because the C/G mutations are substantially
lowere than G/H and explains why the number of C/H
mutations is about 2/3rds that expected based on number
of G/H mutations. Based on C/G differences, almost all of
the rate variance is probably internal to Pan lineages
and then human rate is comparable to gorilla and
orangutan. This substantially lowers both Yu's MRCA and
the Population size Human factor. Did the human fairly
confidence the rate variance that might apply to the MRCA
and popsize determination. Rate determination in
chimpanzee local groups might determine if chimpanzees
are evolving significantly slower at this loci.
PDAH1. I was looking at their analysis. There rate of
mutations is consistent with other xlinked loci except the
fact that one arm of human haplotypes is evolving faster
(about 2 to 3 times) than the other arm. The slower evolving
arm has a rate that is consistent with other apes, but that
other arm is evolving about 3 times faster. This difference
could be due to a large population in a pre-constriction
period or it could be due to mutation rate variance. Its not
clear, but if the rate variance is applied to the faster
moving arm their MRCA and populations size (1.78 and 27 k
inds) drops to 1/2 that value. Human factor, did the humans
consider that intraspecific rate variation might occur.
Xq21.1. I always have something nice to say about Paabo. This
loci appears not to be evolving in an anomolous or
roller coaster fashion. It is probably most
representative of xlinked, although the sample size is
small (for example, I would like to have seen about 5
more samples from the biaka based on the samples they
presented from the biaka) Human Factor Is this MRCA
potentially deeper, more sample from biaka and closely
related peoples might reveal a deeper seqMRCA.
Xq22.2 factor IX locus. This locus suffers from a low number
of human substitutions, there appears to be balance
between chimp human, Orangutan not gorilla used as
outgroup so rate variation in the local group is hard to
survey. All in all this is not a bad study except that
when one has a small number of variant position one
probably needs to sample heavily where diversity is
deepest for the most reliable average of depth of
diversity. Human factor, inadequate sample of humans
I12rg. 9 difference between chimp and human, 29 gorilla and
human chimp and gorilla pairwise unknown. Means one mutation
for every 1.3 million years in the C/H lines, but 0 are found
in humans in a sample of 10, lol. Get more sample.
PLP. Get More sample, C/H difference to low, large percentage
error with 9 differences.
HPRT. Get More sample, C/H differnce marginal but C/H,
Orangutan/H are proportional to expected ages.
GK. Get More sample. C/H O/H are not proportional, There could
be rate variation at this loci.
Ids. Get More sample. C/H differences(5) has high
precentage variation
dmd7- I haven't got around to these yet
dmd44- I haven't got around to these yet. This is also Yu's
work. Yu collected both dmd 7 and dmd 44 variance, but decided
that dmd 7 MRCA (250 kya) wasn't but dmd44 at about 1 my was
representative, so . . . . . . .
What continues to suprise me is that these studies have a
number of statistics that tell them the sky is up and the
ground is down, and yet they still can't determine their ass
from their elbow. The Xq21.2 rate variation oversight was a
major goof. The conclusions of the paper were determinant on a
given rate and that rate can be shown to be significantly in
error. Also surprising is the continued use of small sample
size and poor outgroups sampling. Gorillas are included or
not, Orangutans or not, bonobos infrequently. It seems to me
that if there is a potency for rate variation, sequencing of
bonobo versus most divergent greater chimp would reveal if
there is something going on in chimp that would give pause to
modify the rate in humans.
Also surprising the number of differences between C/H and
J/H are frequently buried in papers, or often not determined.
Yu's most recent work, which includes some sample of X
chromosome (as well as many autosomals) 50 or so 500 nt
chunks from around the human genome, but no chimp outgroup.
Another surprise. Author will say one thing in one paper and
say a completely different thing in another without ever
questioning the validity of the first or newest paper. For
example, harris and hey propose a larger population size in
PDAH1 paper, but then with the FiX paper show population size
is smaller, Distribution? rate variation? Did they go back
and postulate rate vatiation within human PDHA1 haplotypes?
Nope. Then Yu proposes that human were of recent african
origins but left early, and there is this deep 140 ky branch
in eurasia, eurasion lines could not be from a few african
lines. Next study he shows that africans have 3 times the
diversity of eurasians, and that eurasians could be from a
few african lines.
I don't think Monte Carlo can factor in every human
behavior and bias, but I certainly hope it factors in
enough 'jockeying' of MRCAs in a random fashion to make it
of some use.
How does one distinguish whether a locus is underselection or
there has been a change in the mutation rate at the locus. I
wonder if a test for selection will wack the tester upside the
head and say 'hey look stupid, you've got rate variance at
this locus'.
Deowll
Tue, Jul-15-03, 19:15
Just go to the bottom "Philip Deitiker"
<pdeitik@worldnet.att.net> wrote in message
news:3f149c5a.17806806@netnews.worldnet.att.net...
> In my continued effort to provide flamebait for this group .
> . .
>
> I have taken a rather, in hindsite painful task.
>
> I have created a program that runs monte-carlo similuations.
> Currently I have 12 xlinked popstudies that can be compared
> with randomly generated sets based on different assumptions.
>
> The initial runs gave pleasing results however a couple of
> issues have bothered me.
>
> Since I figured I had 12 individual author bias would be
> drawn out by averages. But that the results of a couple of
> these Il2rg and Ids loci would never be hit even by monte
> carlo. This is problematic since Monte Carlo analysis could
> be imployed to look at specific deviations, such as a
> particular MRCA that is say in the 99.9% of ages. IOW I
> might have 10 loci that fit perfectly and have 2 that are so
> far in left feild . . . .
>
> All was going well until I ask the basic question. What is
> it that we need to get to know exactly when an MRCA was.
> 1. The mutation rate (exactly)
> 2. The precise average of substitutions off the seqMRCA in
> the current population.
>
> I have just finished one Loci and I have found all kinds of
> things that cause worry.
>
> Plp locus 2 variant positions in humans 50% of sample have 1
> variant derivative while 50% have seqMRCA. The difference
> between the seqMRCA is 5 to chimpanzee, 10 for gorilla.
>
> The binomial distribution on mutation rates based on 5
> sequence differences ranges from .031 to 2.24 times the
> calculated rate. For 10 it is a little better. The 96%
> confidence range for relative rate differences between chimp
> and gorilla is from gorilla evolving 5.5 times that off
> chimp to 0.45. Thus these rates might apply to an
> evolutionary segment like humans despite the calculated
> mutation rate.
>
> This basic analysis has determined that a low number of
> mutations between chimp and humans gets refactored into the
> analysis on several levels. The variants at the loci may not
> be reponding to the observed rate of mutation but a real
> rate that is faster or slower than would be inferred from
> the chimp/human absolute differences if those differences
> are small. If there are 5 differences between chimp and
> human we assume for instance that each difference represents
> 2.4 million years, and that since each individuals has 0.5
> derivatives then the MRCA is 1.2 million years. But the
> variance of potential differences of chimp and gorilla
> rates could mean that the difference represents a few
> hundred thousand or millions of years.
>
> The basic bottom line. In order for MonteCarlo to be able to
> compare human Derived MRCA versus the MRCAs expected in an
> idealized model the human factor needs to be factored.
>
> This factor can be broken down
> 1. Variance in sample size
> a. Its effect in predicting the depth of mutations at a
> site, and thus range of variation that could explain
> observed values.
> b. Its effect on hitting the deepest variants. For
> example you may have to have 50 african samples to
> find the earliest offshoot branch of seqMRCA, this
> will affect the MRCA calculation. One can actually
> test this by randomly adding a known sample to the
> data set and observe how the MRCA decreases with
> added sample.
> 2. Sample site selection, did someone choose a site with a
> handful of differences with chimpanzee. How does this
> selection affect the external anchoring of mutation
> rates
> 3. Observations of potential variance based on
> differencese of rates between humans, chimps and
> gorilla. To get at this the cloud of low sampling and
> rate variance needs to be removed. If the rate is
> removed how much are the rate variances between gorilla
> and chimp be applicable. This is a human factor because
> for example C/H distance may be shorter than H/G
> arguing for slower evolution on the C side of a branch
> and faster on H side, lowering MRCAs.
>
>
> Each of these human errors can be randomly applied to an
> randomly seleted MRCA inorder to give a humanizedMRCA that
> is of publishable quality. this can be returned by the
> MonteCarlo analysis and retested.
>
> Based on analysis of the data set for which the similated
> 'humanization' process occurs I found several descrepancies
> that I need to report, for the sake of keeping everyone
> informed on molecular foul ups.
>
> Xq21.2 (Yu et al) estimated human MRCA at 741 and popsize
> about 14000. I thought this was a good paper, until I
> started testing the observed differences in mutations
> C/H and G/H, the authors should have provided C/G. It
> is unfortunated because the C/G mutations are
> substantially lowere than G/H and explains why the
> number of C/H mutations is about 2/3rds that expected
> based on number of G/H mutations. Based on C/G
> differences, almost all of the rate variance is
> probably internal to Pan lineages and then human rate
> is comparable to gorilla and orangutan. This
> substantially lowers both Yu's MRCA and the Population
> size Human factor. Did the human fairly confidence the
> rate variance that might apply to the MRCA and popsize
> determination. Rate determination in chimpanzee local
> groups might determine if chimpanzees are evolving
> significantly slower at this loci.
>
> PDAH1. I was looking at their analysis. There rate of
> mutations is consistent with other xlinked loci except the
> fact that one arm of human haplotypes is evolving faster
> (about 2 to 3 times) than the other arm. The slower evolving
> arm has a rate that is consistent with other apes, but that
> other arm is evolving about 3 times faster. This difference
> could be due to a large population in a pre-constriction
> period or it could be due to mutation rate variance. Its not
> clear, but if the rate variance is applied to the faster
> moving arm their MRCA and populations size (1.78 and 27 k
> inds) drops to 1/2 that value. Human factor, did the humans
> consider that intraspecific rate variation might occur.
>
> Xq13.3. I always have something nice to say about Paabo.
> This loci appears not to be evolving in an anomolous
> or roller coaster fashion. It is probably most
> representative of xlinked, although the sample size
> is small (for example, I would like to have seen
> about 5 more samples from the biaka based on the
> samples they presented from the biaka) Human Factor
> Is this MRCA potentially deeper, more sample from
> biaka and closely related peoples might reveal a
> deeper seqMRCA.
>
> Xq22.2 factor IX locus. This locus suffers from a low number
> of human substitutions, there appears to be balance
> between chimp human, Orangutan not gorilla used as
> outgroup so rate variation in the local group is hard
> to survey. All in all this is not a bad study except
> that when one has a small number of variant position
> one probably needs to sample heavily where diversity is
> deepest for the most reliable average of depth of
> diversity. Human factor, inadequate sample of humans
>
> I12rg. 9 difference between chimp and human, 29 gorilla and
> human chimp and gorilla pairwise unknown. Means one mutation
> for every 1.3 million years in the C/H lines, but 0 are
> found in humans in a sample of 10, lol. Get more sample.
>
> PLP. Get More sample, C/H difference to low, large
> percentage error with 9 differences.
>
> HPRT. Get More sample, C/H differnce marginal but C/H,
> Orangutan/H are proportional to expected ages.
>
> GK. Get More sample. C/H O/H are not proportional, There
> could be rate variation at this loci.
>
> Ids. Get More sample. C/H differences(5) has high precentage
> variation
>
> dmd7- I haven't got around to these yet
>
> dmd44- I haven't got around to these yet. This is also Yu's
> work. Yu collected both dmd 7 and dmd 44 variance, but
> decided that dmd 7 MRCA (250 kya) wasn't but dmd44 at about
> 1 my was representative, so . . . . . . .
>
> What continues to suprise me is that these studies have a
> number of statistics that tell them the sky is up and the
> ground is down, and yet they still can't determine their ass
> from their elbow. The Xq21.2 rate variation oversight was a
> major goof. The conclusions of the paper were determinant on
> a given rate and that rate can be shown to be significantly
> in error. Also surprising is the continued use of small
> sample size and poor outgroups sampling. Gorillas are
> included or not, Orangutans or not, bonobos infrequently. It
> seems to me that if there is a potency for rate variation,
> sequencing of bonobo versus most divergent greater chimp
> would reveal if there is something going on in chimp that
> would give pause to modify the rate in humans.
>
> Also surprising the number of differences between C/H and
> G/H are frequently buried in papers, or often not
> determined. Yu's most recent work, which includes some
> sample of X chromosome (as well as many autosomals) 50 or
> so 500 nt chunks from around the human genome, but no
> chimp outgroup.
>
> Another surprise. Author will say one thing in one paper and
> say a completely different thing in another without ever
> questioning the validity of the first or newest paper. For
> example, harris and hey propose a larger population size in
> PDAH1 paper, but then with the FiX paper show population
> size is smaller, Distribution? rate variation? Did they go
> back and postulate rate vatiation within human PDHA1
> haplotypes? Nope. Then Yu proposes that human were of recent
> african origins but left early, and there is this deep 140
> ky branch in eurasia, eurasion lines could not be from a few
> african lines. Next study he shows that africans have 3
> times the diversity of eurasians, and that eurasians could
> be from a few african lines.
>
> I don't think Monte Carlo can factor in every human
> behavior and bias, but I certainly hope it factors in
> enough 'jockeying' of MRCAs in a random fashion to make it
> of some use.
>
> How does one distinguish whether a locus is underselection
> or there has been a change in the mutation rate at the
> locus. I wonder if a test for selection will wack the tester
> upside the head and say 'hey look stupid, you've got rate
> variance at this locus'.
>
>
I'm sure you're right but the bottom line is the obvious. When
people keep changing their story it means they aren't sure of
the facts. Every piece of genetic material that can be
inherited seperately certainly has been and must been run down
to its most probable point of origin. Fragments of dying
linages may been missed by sampling and while genes have
ancestors and not all fossils have descendents people have
ancestors from whom they have no genes so the whole thing is
still going to be a mess.
Most people stink and math and I think this is part of the
problem. Even when they have good data they are still feeding
in garbage having the comupter do the wrong things with the
data and getting garbage back out. That they don't catch this
is harder to understand. Maybe they do but what to keep on
working in the field.
Bob Keeter
Tue, Jul-15-03, 19:15
"Philip Deitiker" <pdeitik@worldnet.att.net> wrote in message
news:3f149c5a.17806806@netnews.worldnet.att.net...
> In my continued effort to provide flamebait for this group .
> . .
There you go flattering me again! 8-)
> I have taken a rather, in hindsite painful task.
>
You are AMAZING! Perhaps a quick Googlizing of "Philip
Deitiker" in the groups section would be a telling tale! Lets
see where philip has been leaving his little "cigars" of love
and joy over the last few days while he has been doing all of
this high faluting, statistical modeling. . . . . 8-0
Or maybe not. 8-) After all there IS that theory about the
10000 monkeys with 10000 keyboards. . . . ;-)
> I have created a program that runs monte-carlo similuations.
> Currently I have 12 xlinked popstudies that can be compared
> with randomly generated sets based on different assumptions.
>
> The initial runs gave pleasing results however a couple of
> issues have bothered me.
Lets see if I can translate. You have modeled 12
"breeding" populations with some degree of interchange
between the groups?
Statistics dont lie. . . . ..
> Since I figured I had 12 individual author bias would be
> drawn out by averages. But that the results of a couple of
> these Il2rg and Ids loci would never be hit even by monte
> carlo. This is problematic since Monte Carlo analysis could
> be imployed to look at specific deviations, such as a
> particular MRCA that is say in the 99.9% of ages. IOW I
> might have 10 loci that fit perfectly and have 2 that are so
> far in left feild . . . .
Bleep. . . . Lost me! 8-)
> All was going well until I ask the basic question. What is
> it that we need to get to know exactly when an MRCA was.
> 1. The mutation rate (exactly)
> 2. The precise average of substitutions off the seqMRCA in
> the current population.
Ohhhhhh! Say it ain't so! How could such immaterial things as
this be a problem! 8-)
> I have just finished one Loci and I have found all kinds of
> things that cause worry.
>
> Plp locus 2 variant positions in humans 50% of sample have 1
> variant derivative while 50% have seqMRCA. The difference
> between the seqMRCA is 5 to chimpanzee, 10 for gorilla.
Could it be that you are dealing with a far too limited
"subset" of the genetic data? Subsets, even if totally random
have to pass certain "checks" to provide a meaningful
inference. Why do you think that all of those opinion polls
end up with +/- x percentage points? Its usually based on
sampling size, but can also be effected by "selective
sampling".
> The binomial distribution on mutation rates based on 5
> sequence differences ranges from .031 to 2.24 times the
> calculated rate. For 10 it is a little better. The 96%
> confidence range for relative rate differences between chimp
> and gorilla is from gorilla evolving 5.5 times that off
> chimp to 0.45. Thus these rates might apply to an
> evolutionary segment like humans despite the calculated
> mutation rate.
Most troublesome! Does this mean that the "drift rate" even
varies across the width of a single genome, much less across
thousands of years and varying environmental inputs! Geez,
what does THIS mean for the genetic dating of MRCA! 8-)
> This basic analysis has determined that a low number of
> mutations between chimp and humans gets refactored into the
> analysis on several levels. The variants at the loci may not
> be reponding to the observed rate of mutation but a real
> rate that is faster or slower than would be inferred from
> the chimp/human absolute differences if those differences
> are small. If there are 5 differences between chimp and
> human we assume for instance that each difference represents
> 2.4 million years, and that since each individuals has 0.5
> derivatives then the MRCA is 1.2 million years. But the
> variance of potential differences of chimp and gorilla
> rates could mean that the difference represents a few
> hundred thousand or millions of years.
Quite a confidence interval you have working there big feller!
Just remember that wide confidence intervals USUALLY just
reflect an honest assessment of the data at hand! Tight
confidence intervals usually indicate either a huge sampling
effort on moderately variant data (or complicated math errors,
or simple lies!)
> The basic bottom line. In order for MonteCarlo to be able to
> compare human Derived MRCA versus the MRCAs expected in an
> idealized model the human factor needs to be factored.
>
> This factor can be broken down
> 1. Variance in sample size
> a. Its effect in predicting the depth of mutations at a
> site, and thus range of variation that could explain
> observed values.
> b. Its effect on hitting the deepest variants. For
> example you may have to have 50 african samples to
> find the earliest offshoot branch of seqMRCA, this
> will affect the MRCA calculation. One can actually
> test this by randomly adding a known sample to the
> data set and observe how the MRCA decreases with
> added sample.
> 2. Sample site selection, did someone choose a site with a
> handful of differences with chimpanzee. How does this
> selection affect the external anchoring of mutation
> rates
> 3. Observations of potential variance based on
> differencese of rates between humans, chimps and
> gorilla. To get at this the cloud of low sampling and
> rate variance needs to be removed. If the rate is
> removed how much are the rate variances between gorilla
> and chimp be applicable. This is a human factor because
> for example C/H distance may be shorter than H/G
> arguing for slower evolution on the C side of a branch
> and faster on H side, lowering MRCAs.
Welcome to the real world!
> Each of these human errors can be randomly applied to an
> randomly seleted MRCA inorder to give a humanizedMRCA that
> is of publishable quality. this can be returned by the
> MonteCarlo analysis and retested.
And sometimes I thought that I wrote in babbelized and
indecipherable jargon. . . . but grammatically diagram the
above sentence!
> Based on analysis of the data set for which the similated
> 'humanization' process occurs I found several descrepancies
> that I need to report, for the sake of keeping everyone
> informed on molecular foul ups.
Philip, this is just too good of a set up line! I cant pass it
up, but honor does not allow me to take it! Please! SOMEONE
pick up on this one! On one hand you have Philip's "noble"
efforts and on the other you have his own terminology of
"molecular foul ups"! Pick up the ball and run! 8-))))))
> Xq21.2 (Yu et al) estimated human MRCA at 741 and popsize
> about 14000. I thought this was a good paper, until I
> started testing the observed differences in mutations
> C/H and G/H, the authors should have provided C/G. It
> is unfortunated because the C/G mutations are
> substantially lowere than G/H and explains why the
> number of C/H mutations is about 2/3rds that expected
> based on number of G/H mutations. Based on C/G
> differences, almost all of the rate variance is
> probably internal to Pan lineages and then human rate
> is comparable to gorilla and orangutan. This
> substantially lowers both Yu's MRCA and the Population
> size Human factor. Did the human fairly confidence the
> rate variance that might apply to the MRCA and popsize
> determination. Rate determination in chimpanzee local
> groups might determine if chimpanzees are evolving
> significantly slower at this loci.
>
> PDAH1. I was looking at their analysis. There rate of
> mutations is consistent with other xlinked loci except the
> fact that one arm of human haplotypes is evolving faster
> (about 2 to 3 times) than the other arm. The slower evolving
> arm has a rate that is consistent with other apes, but that
> other arm is evolving about 3 times faster. This difference
> could be due to a large population in a pre-constriction
> period or it could be due to mutation rate variance. Its not
> clear, but if the rate variance is applied to the faster
> moving arm their MRCA and populations size (1.78 and 27 k
> inds) drops to 1/2 that value. Human factor, did the humans
> consider that intraspecific rate variation might occur.
>
> Xq13.3. I always have something nice to say about Paabo.
> This loci appears not to be evolving in an anomolous
> or roller coaster fashion. It is probably most
> representative of xlinked, although the sample size
> is small (for example, I would like to have seen
> about 5 more samples from the biaka based on the
> samples they presented from the biaka) Human Factor
> Is this MRCA potentially deeper, more sample from
> biaka and closely related peoples might reveal a
> deeper seqMRCA.
>
> Xq22.2 factor IX locus. This locus suffers from a low number
> of human substitutions, there appears to be balance
> between chimp human, Orangutan not gorilla used as
> outgroup so rate variation in the local group is hard
> to survey. All in all this is not a bad study except
> that when one has a small number of variant position
> one probably needs to sample heavily where diversity is
> deepest for the most reliable average of depth of
> diversity. Human factor, inadequate sample of humans
>
> I12rg. 9 difference between chimp and human, 29 gorilla and
> human chimp and gorilla pairwise unknown. Means one mutation
> for every 1.3 million years in the C/H lines, but 0 are
> found in humans in a sample of 10, lol. Get more sample.
>
> PLP. Get More sample, C/H difference to low, large
> percentage error with 9 differences.
>
> HPRT. Get More sample, C/H differnce marginal but C/H,
> Orangutan/H are proportional to expected ages.
>
> GK. Get More sample. C/H O/H are not proportional, There
> could be rate variation at this loci.
>
> Ids. Get More sample. C/H differences(5) has high precentage
> variation
>
> dmd7- I haven't got around to these yet
>
> dmd44- I haven't got around to these yet. This is also Yu's
> work. Yu collected both dmd 7 and dmd 44 variance, but
> decided that dmd 7 MRCA (250 kya) wasn't but dmd44 at about
> 1 my was representative, so . . . . . . .
>
> What continues to suprise me is that these studies have a
> number of statistics that tell them the sky is up and the
> ground is down, and yet they still can't determine their ass
> from their elbow.
I will have to remember this one! Good little turn with
the words!
> The Xq21.2 rate variation oversight was a major goof. The
> conclusions of the paper were determinant on a given rate
> and that rate can be shown to be significantly in error.
> Also surprising is the continued use of small sample size
> and poor outgroups sampling. Gorillas are included or not,
> Orangutans or not, bonobos infrequently. It seems to me that
> if there is a potency for rate variation, sequencing of
> bonobo versus most divergent greater chimp would reveal if
> there is something going on in chimp that would give pause
> to modify the rate in humans.
Small sample sizes and poor outgroup samplings DO make
'discoveries' much easier! Anybody can take the time to
accumulate enough samples for statistical significance; it
takes genius to pull meaningful information out of what is
statistically "white noise"! Guess its that "meaningful"
issue again!
;-)
> Also surprising the number of differences between C/H and
> G/H are frequently buried in papers, or often not
> determined. Yu's most recent work, which includes some
> sample of X chromosome (as well as many autosomals) 50 or
> so 500 nt chunks from around the human genome, but no
> chimp outgroup.
>
> Another surprise. Author will say one thing in one paper and
> say a completely different thing in another without ever
> questioning the validity of the first or newest paper.
8-) Has to do with honesty, personal integrity, and
professionalism I suppose! Sometimes the "least impact" would
be the simple statement that "I screwed up!". Guess that just
does not publish well. 8-)
Or is that perhaps where ego comes into conflict with science.
. . but we would not know anything about that!
> For example, harris and hey propose a larger population size
> in PDAH1 paper, but then with the FiX paper show population
> size is smaller, Distribution? rate variation? Did they go
> back and postulate rate vatiation within human PDHA1
> haplotypes? Nope. Then Yu proposes that human were of recent
> african origins but left early, and there is this deep 140
> ky branch in eurasia, eurasion lines could not be from a few
> african lines. Next study he shows that africans have 3
> times the diversity of eurasians, and that eurasians could
> be from a few african lines.
>
> I don't think Monte Carlo can factor in every human
> behavior and bias, but I certainly hope it factors in
> enough 'jockeying' of MRCAs in a random fashion to make it
> of some use.
8-)
> How does one distinguish whether a locus is underselection
> or there has been a change in the mutation rate at the
> locus. I wonder if a test for selection will wack the tester
> upside the head and say 'hey look stupid, you've got rate
> variance at this locus'.
Er.. . . . some potential answers!
1. You CANT distinguis whether a lucus is under selection or
if there has been a change in the nutation rate at the
locus unless you have a full and fairly complete "history"
of that locus over the period you are interested in. You
might get an idea of the overall "relative rate" if you
compare one locus of some nominal length with another
locus of equivalent length between two samples.
For example if you took a 1000 element locus from chimp and
human and compared the number of "deltas" to a separate 1000
element locus from a different location in the genome, if the
rate of mutation and selective pressures were similar for
those two segments, you would expect to see the same number
of "deltas". Since mutation at least would be a nice random
function, there might still be some differences. So. . . you
are faced with the option of comparing other 1000 element
loci to see if its all "random scatter" or of one or another
loci separates itself from the others either by a dearth of
deltas or a huge increase. Carried to the extreme, take a 16k
element run (like mtDNA) clip it into 100 element strings and
count the deltas between the two samples for each segment.
Clip it into 1000 element loci and re-count the deltas.
Figure out what the
:"average" for the 1000 element loci is per element. Any of
:the 100 element
loci with SIGNIFICANTLY greater that the average are probably
areas where SOMETHING is messing with the selection or
mutation effects.
Perhaps it would also "whack" the investigator up beside the
head and say, "Hey stupid, DNA mutation rates are not constant
at all!" What do you think?
REgards bk
Philip Dei
Tue, Jul-15-03, 19:15
On Tue, 15 Jul 2003 12:35:36 -0500, "deowll"
<deowll@bellsouth.net> wrote:
>I'm sure you're right but the bottom line is the obvious.
>When people keep changing their story it means they aren't
>sure of the facts. Every piece of genetic material that can
>be inherited seperately certainly has been and must been run
>down to its most probable point of origin.
From my point of view, and I suppose this is a bit of a
warning, I only noticed some of these anomalies because I
began searching for some obscure tidbits of information.
> Fragments of dying linages may been missed by sampling and
> while genes have ancestors and not all fossils have
> descendents people have ancestors from whom they have no
> genes so the whole thing is still going to be a mess.
Yes, I certainly wonder about some of the sample sizes, but
most of the rate variation I see points to more recent MRCAs
among those that plot off the line. Albeit they are not far
off the line, but my Z score based Monte Carlo analysis is
sensitive enough to argue that in the context of the other
older MRCAs that the PDHA1 creates as situation with a higher
than expected Z-score. I cannot do anything about the MRCA but
what I can do is plug rate variance into the random selection
of MRCAs in order to account for that source of variance. By
and large, unless you accept that the world wide population of
hominids outside of africa was 1 or 2000 individuals
constantly streaming from africa and replacing the genetic
makeup of the peripheral world groups, the introduction of DNA
would have to have been limited to literally a handful of
intermating events.
>Most people stink and math and I think this is part of
>the problem.
I think plugging the data into present computer programs is a
good way of shielding oneselves from seeing the nuances is
the data set.
> Even when they have good data they are still feeding in
> garbage having the comupter do the wrong things with the
> data and getting garbage back out.
I wouldn't say wrong, but if you've got one line that has 10
mutations in it and a second that only has 3, and the 3 rate
is places your MRCA on par with the rate based expectation and
the 10 rate places the MRCA back into the pliocene, and even
though the distribution of MRCA still is tolerant, but then
you go out say know the population size was 27000. I would say
that is not a fair consideration of all the possible even more
likely explanations. Showing that even intraspecific loci
specific evolutionary rates can shift is as important as
trying to confuscate MRCA and popsize issues.
>That they don't catch this is harder to understand. Maybe
>they do but what to keep on working in the field.
Possibly, drag things out for 30 years when 2 or 3 real good
studies would do. I think a global profiling of Xq13.3 would
be real nice, similar to that of mtDNA HVR1. Get 2 or 3 more
loci with non-anomolous evolution and I think you have a
relatively clear vision of at least female evolution (combined
with mtDNA). Anyway it aint going to happen, we have a zigzag
line of improvments and I think within a few improvements that
that poor quality works will be obvious and can be dropped. In
the mean time looks like I need to play a little Monte Carlo.
Pete
Wed, Jul-16-03, 06:11
Bob Keeter wrote:
>
> "Philip Deitiker" <pdeitik@worldnet.att.net> wrote in
> message news:3f149c5a.17806806@netnews.worldnet.att.net...
> > In my continued effort to provide flamebait for this
> > group .
> > . .
>
> There you go flattering me again! 8-)
>
> > I have taken a rather, in hindsite painful task.
> >
>
> You are AMAZING! Perhaps a quick Googlizing of "Philip
> Deitiker" in the groups section would be a telling tale!
> Lets see where philip has been leaving his little
> "cigars" of love and joy over the last few days while he
> has been doing all of this high faluting, statistical
> modeling. . . . . 8-0
>
> Or maybe not. 8-) After all there IS that theory about the
> 10000 monkeys with 10000 keyboards. . . . ;-)
http://www.geocities.com/Area51/Portal/9039/sounds/burns-mo.au
--
pete
Bob Keeter <rkeeter@earthlink.net> wrote:
> 1. You CANT distinguis whether a lucus is under selection
> or if there has been a change in the nutation rate at
> the locus unless you have a full and fairly complete
> "history" of that locus over the period you are
> interested in. You might get an idea of the overall
> "relative rate" if you compare one locus of some nominal
> length with another locus of equivalent length between
> two samples.
>
Wow, all CAPS! Seems to indicate that you're all
CONFIDENT eh?D
OTH it is pretty basic knowledge, that the mutation rate is
not constant across the entire genome. Hence it would be
pointless to compare two different loci.
There are other procedures to solve the problem in question.
> .... take a 16k element run (like mtDNA) ...
That reminds me on a question:
/*
What exactly do you call a "nominal distance"? What distances
do you have observed in that set I recommended you?
*/
Lets see if you're still in the A.G. mode.
Michael
mb <cai@pirin.ha> wrote:
> /*
>
> What exactly do you call a "nominal distance"? What
> distances do you have observed in that set I
> recommended you?
>
> */
>
> Lets see if you're still in the A.G. mode.
It seems you are - Well and my respect for you is reaching
P.D. level.
Michael
Philip Dei
Fri, Jul-18-03, 19:15
On Fri, 18 Jul 2003 20:18:46 +0200, scho@kola.de (mb) wrote:
>It seems you are - Well and my respect for you is reaching
>P.D. level.
Its nice to see I am the topic of at least some, if not
repetitive, discussion.
Have you ever heard of trying to squeeze blood from a turnip.
I tried to help him, Gisele tried to help him, your trying
to help. You might come to the conclusion that his
attempted dissection of these sequences is nothing more
that fluff and show, he thought something different was
there, but when he looked into and couldn't find what he
was looking for, he simply started stalling and delay the
presentation of an inevitable opinion, all the worse it
probably would be in agreement with what I presented, a
fate for him worse than death.
Bob Keeter
Sat, Jul-19-03, 19:14
"Philip Deitiker" <pdeitik@bcm.tmc.edu> wrote in message
news:kjoghv4evqhpvunog2pjduj93vts8d4m80@4ax.com...
> On Fri, 18 Jul 2003 20:18:46 +0200, scho@kola.de (mb) wrote:
>
> >It seems you are - Well and my respect for you is reaching
> >P.D. level.
>
> Its nice to see I am the topic of at least some, if not
> repetitive, discussion.
>
> Have you ever heard of trying to squeeze blood from a
> turnip.
>
> I tried to help him, Gisele tried to help him, your trying
> to help. You might come to the conclusion that his
> attempted dissection of these sequences is nothing more
> that fluff and show, he thought something different was
> there, but when he looked into and couldn't find what he
> was looking for, he simply started stalling and delay the
> presentation of an inevitable opinion, all the worse it
> probably would be in agreement with what I presented, a
> fate for him worse than death.
8-)
Philip, old boy, how many times have I repeated that I SUSPECT
you probably do know at least some of the science you try to
spout? I simply cant tell what to believe and what to
disbelieve without looking for myself. Cant figure out where
that "lie for the sake of your USENET personna" ends and where
honest science begins! Frankly, if you were to say that the
sun was rising tomorrow morning, I would feel the need to get
up early and check (in spite of my own personal beliefs!). 8-)
But you see, that is a critical part of the "Poster Boy" act.
Oil and water CAN mix! You toss in a few spices and such,
shake them up and you can make a nice little salad dressing.
Take personal dishonesty, prejudice, hate, venomous invective;
mix those with honest science through the shaking of "USENET",
or in any other public forum, and the result is a lot less
palatable! Embarassing in fact. But then you said yourself
that you have no shame.
SO. . . . keep up the good work! Let everyone see your
"version" of a professional at work! PLEASE! If it were not
for you and Marc (and a few others of your ilk), my little
personal "windmill" would be very difficult to even present
much less debate! With you two, it becomes a "slam dunk"!
Of course, without you, I would not have a windmill either.
. . .! 8-)
!!!!!!!
Atta boy, Poster Boy!
Regards bk
Copyright 2000-2009 Active Low-Carber Forums @ forum.lowcarber.org
vBulletin, Copyright ©2000-2009, Jelsoft Enterprises Ltd.