Re: [Swarm-Modelling] Parameter-fitting and model comparison methods for

swarm-modeling

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Swarm-Modelling] Parameter-fitting and model comparison methods for

From:	chris rohe
Subject:	Re: [Swarm-Modelling] Parameter-fitting and model comparison methods for ABM?
Date:	Sun, 01 Jun 2003 18:33:07 +0000

Darren,

I would like to request a copy of your paper on evaluating models. My M.A.was recently completed, which focused on creating predictive models forprehistoric archaeological sites in the Rocky Mountains using Boolean,weighting, and regression models. I am very interested in agent based andneural network approaches, but completely lost in their methods. I hope tocontinue to learn new ideas and approaches in such computer techniques.Please let me know if I can get a copy of that paper and it would be betterto send it to me at my work email. address@hidden Thanks

Chris Rohe
Assisitant Director of Cartography and Geospatial Technologies
Statistical Research, Inc.

From: Darren Schreiber <address@hidden>
Reply-To: address@hidden
To: address@hidden
Subject: Re: [Swarm-Modelling] Parameter-fitting and model comparisonmethods for ABM?
Date: Thu, 29 May 2003 04:42:15 -0700
This is a good opportunity to bounce around an idea I have been toying withfor about a year.
Imagine if we were to use a notation:

y=m(a,b, x)
where m means method (akin to f meaning function), alpha is the algorithmused, beta is the set parameter settings, x is the inputs to the model, andy is the outputs from the model. We could then think about evaluatingmodel fit of ABM's in a similar manner to evaluating model fit forregression models.
In a paper on model evaluation (aka validation), I argue for a four partontology connected to a list of 20 or so evaluation techniques. Myproposed ontology is Theory--Model--Phenomena--Noumena. In this ontology,theory is the idea in your head about how the world really works. The modelis the expression of the theory in a concrete form. Phenomena is the setof observations we have. And, the Noumena (using a phrase from Kant) isthings as they actually are (Truth with a capital T). My distinctionbetween theory and model comes out of my background as an attorney. Icannot copyright my great American novel (theory) until I have written it(model.)
In typical regression models, we would look for the best parameters beta tominimize residuals (or something like this depending on the estimator) butwe don't tinker with the functional form of the equation. In evaluatingABM's however, we can tinker with both the algorithms and the parameters inorder to adjust the fit between the theory and the phenomena. It isn'thard to imagine that a sufficiently complex problem might even warrantusing genetic programming to create myriad swarm simulations in order tofind an algorithm and set of parameters that best fits the problem. Thevery clever next step (suggested by John Miller at CMU) would be to thencreate another search to see how easy it is to "break" the model in orderto evaluate the robustness of the resulting algorithm/parameter set. Greatmodels would be ones that both fit the data and are robust in fitting thedata. You would probably also want to partition your dataset so that youare doing exploratory analysis with one dataset and confirmatory analysison the second.
Statistical theory about evaluating regression models is hitting somewhatof a quandry now because of the computer revolution. It used to be thatthe 95% confidence interval made a lot of sense as a rule of thumb, becauserunning regression model took a big stack of punch cards and a lot ofmainframe time. But some statistical packages now have the power to searchthe possible combinations of variables and functional forms to find thebest fits with the data. And, other kinds of data mining have become animportant (even life saving) art. But, if I have run three hundredpossible models against my data set, it would not be shocking if fifteen ofthem are getting significance at the .05 level even if the data set is justrandom numbers. Some have argued that sloppy statistical practices areprobably leading to spurious result rates of 50% in some disciplines.
Imagine, I mine the hell out of my data and using eight parameters finallyget a .05 result from my data set. You kind of think my theory is kooky,but since most journals aren't publishing null findings, you feel obligedto start with my kooky theory and data mine from there until you also meetthe .05 threshold. It isn't hard to see how you could even get an entireline of research based off of nothing but totally random numbers and yetmeeting all of the typical standards for publication.
The problem is being compounded by the escalation in types of estimatorsand models (OLS, GLM, probit, logit, scobit, MLE, MCMC, neural networks,ABMs.) If I can't get .05 results from my data with OLS, should I justmove to another form? When methodologists lose sight of the substantiveproblems it can be easy to just drift into more and more arcane methods.
A related problem is that if I proliferate the parameters in a regressionmodel very high, I have quickly defined a nearly incomprehensible highdimensional space where robustness may be a huge unknown and theoreticalunderstanding is almost certainly lost. Political scientist Chris Achenhas thus argued for "ART: A Rule of Three." You better have a damned goodreason for having more than three variables in your regression model.
As I have seen how the problems facing statistics are starting to cause anunraveling of traditional practice and theory, it has given me increasingconfidence that evaluating agent-based models does not have to be relegatedto the black art status. In fact, our consciousness of the flexibility ofour algorithms and parameters may even give us a conceptual advantage ingetting ahead of these problems and perhaps even providing leadership toother types of modelers.
In my current thinking, modeling of all sorts (statistical, formal,computational, qualitative, narratives, interpretive dance, etc) face acommon set of problems. The Matrix as a movie may get a positiveevaluation because it resonates some "real" truths (in the noumenal sense).Battlefield Earth (a horrifyingly bad film) fails because its plots,characters, technology, and acting are all completely unbelievable or notworthy of belief or suspended disbelief. Models work when they address aproblem. As Jane Azevedo argues in "Mapping Reality", not all models areappropriate for all problems and models can only be evaluated in thecontext of the problem they are addressing. A topological map is great forhiking but terrible for navigating the subway system of London.
To get around the problems we face in evaluating agent-based models, weneed to return to some first principles of epistemology and metaphysics, weneed to develop best-practices that fit our current problems and technologyand their foreseeable extensions, and we need to develop some communityconsensus about those best practices. My prediction is that in about a fewyears the problems of current statistical practice (publication based uponasterixes, thoughtless superpowered data mining, no null findings, noreplication studies, no substantive interpretation of coefficients) willhit some academic headlines and cause a big rethink. Agent-based modelingis going to have to do that thinking now to move ahead. And just borrowingthe rubric from standard stats isn't going to do it.
Steve, I am sending you a copy of my evaluating models paper under aseparate cover. I appreciate feedback on it since I am going to be doing amajor rework of it in the fall. Anyone who would like a copy please ask.It has more coherence than this late night (early morning) set of rambling.
        Darren



On Wednesday, May 28, 2003, at 06:27  AM, Steve Railsback wrote:
Hi-

I am trying to write up some recommendations for analysis of agent-based
models, and am not sure what to say about traditional methods for (1)
fitting model parameters to data and (2) comparing model versions by
their ability to fit data.

In the stochastic simulation literature (e.g., Law and Kelton 1999)
there is discussion of some traditional techniques like maximum
likelihood estimation; but I've never seen such techniques (also
including Akaika's Information Criterion and Bayesian analysis) applied
to agent-based models. Instead, the few examples of parameter-fitting
I've found use a simple filtering process- simulate a billion
alternative parameter sets and identify the ones that produce acceptable
results.

Lacking a background in statistics, I can't help wondering if there are
not some fundamental obstacles to using these traditional techniques for
ABMs, due to things like:

- The observed relation of a particular output to a particular input
potentially being extremely weird and 'noisy' even if the input has a
simple, strong effect on the agent behavior that produces the output.

- The large number of pathways by which a system can arrive at a
particular state?

- With AIC, the analysis depends heavily on the number of parameters in
the model, which by itself could be a very interesting problem. What
really constitutes a 'parameter'?

- Does the conventional equating of degrees of freedom with # of
parameters make sense?

Does anyone have any literature, experience, understanding??

Thanks,

Steve Railsback

Lang, Railsback & Assoc.
Arcata CA

_______________________________________________
Modelling mailing list
address@hidden
http://www.swarm.org/mailman/listinfo/modelling
_______________________________________________
Modelling mailing list
address@hidden
http://www.swarm.org/mailman/listinfo/modelling


_________________________________________________________________

The new MSN 8: smart spam protection and 2 months FREE*http://join.msn.com/?page=features/junkmail

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Swarm-Modelling] Parameter-fitting and model comparison methods for ABM?, chris rohe <=

Next by Date: Re: [Swarm-Modelling] world getObjectAt
Next by thread: Re: [Swarm-Modelling] world getObjectAt
Index(es):
- Date
- Thread