Tag: Data Science

  • Elmer Fudd on a Pilgwimage

    Context is a rabbit hole, and I am NOT Elmer Fudd. I caught the Rabbit.

    It wasn’t easy. The problem started when I actually had to figure out what Context is rather than what it does. It’s complicated.

    As a tool, Context is like a hammer. You use it but you don’t ponder much about it. But it’s a shifty, enigmatic thing, like a hammer the first time you see it that becomes a vase of flowers after you blink

    BI limiting Context to my project & the problem of identifying Story Narratives in a website, I figured it out. Here is a few things about Context:

    • You can impose your own context, like writing a business meeting Agenda. Or it can emerge from a conversation that goes from meandering to fix on a compelling point. I call this Deterministic & Incidental context.

    • A single tag often has multiple meanings, Data Scientists call that Ambiguous. Joining it with a related tag narrows the meaning, eventually resolving it to one, what Data Scientists call Disabmbiguation.
    • 2 ore more tags are related when they occur in more than one post. I call this The Story Principle.
    • 2 or more related tags are a Context. Sometimes, I call that group of tags a Storyline.
    • A group of 2-4 tags suggest a thing. More than that suggest multiple things. Many multiple things. A group of tags may hold many clusters of tags. This is Convergence and divergence, Clustering.
    • Tags in one post join many different tags in many other posts. There could be hundreds of clusters in even a small web-site. I call this Proliferation though some would argue it’s Polymorphy.
    • Tags are promiscuous, combining, conflating, comingling with each other. Each tag has many facets, many faces. Tags & clusters of tags are Polymorphic. This means yuo need multiple tags to resolve to a clear context.
    • We have explicit context, a structure of nested tags, but we also have implicit context. You may see Apple, Orange as fruit, but implicit to this website, they are a Context example. The venue as another context as opposed to the one in your mind is a Scope.
    • You can have multiple contexts, nested within each other in some order. Data Science calls this rule of order a Schema. This is a hierarchy but could also be a Breadcrumb or Branch of a tree, or a Chain.
    • There is no fixed order, or scheme for nested, hierarchical contexts. Whether you manually build one or whether an algorithm uses a clever method to do so. The last point is why nobody can just build a huge tree of Everything even if we have the capability. Nobody can agree on the Schema.
    • I use tag weights, how manyposts they occur in as the means of consistently ordering groups of tags. Honestly, this is a very weak justification & results in novel & interesting juxtapositions of concepts, but it is a consistent, understandable way to create these hierarchies.
  • What is Simili?

    Simili is the movement to create the Internet Library for Humanity, Terra Cognos.

    The Blind Guy, Paul Wolborsky, is founder, original architect & developer for the infrastructure & essential technologies.

    The Internet is the greatest treasure Humankind ever owned, but its knowledge & data are all dissolute, incoherent, siloed, raw nuggets buried in mountains of content. Google built a trillion dollar industry around keyword search & ChatGPT/Gemini are a great step forward, but those are proprietary technologies & the model resembles some guy who goes into the back room to get a thing you ask for. We need an open framework that makes knowledge & wisdom available to everybody yet still generate wealth for those who build it.

    To create this great Library, Simili will create an array of technologies that will generate indexes, tables of contents, categories, hierarchical contexts, tags, data sets and browsing tools to help all humans gain knowledge & wisdom from the web.

    We start with a WordPress plugin, Simili Trees, which implements the first core concepts. Then, we create the Simili SearchSpace that will initially serve WordPress websites that install the plugin. Then, we will build more client-side applications for other websites, big & small.

    The Terra Cognos will consiste of one or more colonies with a hice & swarm based on blockchain and digital currency.

    The Terra Cognos will embrace Data Science, Content Mining, RDF knowlege and AI.

    The SearchSpace will be fertile grounds for a vast for-profit and non-profit ecosystem, with wealth, livelihoods and gig work available to anyone in the world with a Smart-phone.

    And generations from now, it will accompany our species as it expands into the Galaxy.

    And it all starts with trees made from tags.

    Simili is the movement to create the Internet Library for Humanity, Terra Cognos.

    The Internet is the greatest treasure Humankind ever owned, but its knowledge & data are all siloed, raw nuggests buried in mountains of content.

    In order to create this great Library, Simili will create an array of technologies that will generate indexes, tables of contents, categories, hierarchical contexts, tags, data sets and browsing tools to help all humans gain knowledge & wisdom from the web.

    We start with a WordPress plugin, Simili Trees, which implements the first core concepts. Then, we create the Simili SearchSpace that will initially serve WordPress websites that install the plugin. Then, we will build more client-side applications for other websites, big & small.

    The Terra Cognos will consiste of one or more colonies with a hice & swarm based on blockchain and digital currency.

    The Terra Cognos will embrace Data Science, Content Mining, RDF knowlege and AI.

    The SearchSpace will be fertile grounds for a vast for-profit and non-profit ecosystem, with wealth, livelihoods and gig work available to anyone in the world with a Smart-phone.

    And generations from now, it will accompany our species as it expands into the Galaxy.

    And it all starts with trees made from tags.