28 December 2020
25 December 2020
Webvowl
23 December 2020
22 December 2020
21 December 2020
19 December 2020
Relationship Conundrums
Is a woman's place in the kitchen? In many traditionalist societies, it is expected that a woman's place is in the house after marriage. Which is not only odd but it rules out the aspect of equality between a man and a woman. At the start, a man expects the woman to stay the same as she is before marriage and after marriage. On the other, the woman thinks about changing the man after marriage. This thinking dynamics causes all sorts of issues. The man is unlikely to want to change after marriage. But, invariably, women do change after marriage. The person the man wanted to marry is no longer the person they married anymore. However, an educated woman that decides to take on the role of a housewife after marriage, especially in western society, will feel like she has lost everything in her life. Everything that she aspired to achieve is no longer available to her as an option. Unless, obviously if she aspired to be a housewife, that is another matter altogether. A woman spending eight to ten hours stuck caged in a house while the man is out working is bound to create boredom and frustration. When the man comes home after having spent long ours at work, he will spend it at home or do whatever makes him relax. The woman will want to go out. This is likely to lead to friction in form of nagging, whinging and most of all heated arguments. The woman will feel that the man does not respect or understand her. The man will feel the same way towards the woman. If this continues, bonding between the two never forms and they become distant overtime. The woman might feel having children might solve the issue, but that only adds to the man's financial burdens. In fact, it also leads to health issues especially as not only is the man working long hours but it becomes another shift of work when they come home and have to take the wife out. The woman may even use food as a source of releasing her frustration leading to back and forth dieting. While the husband will spend more time away in the workplace trying to avoid the confrontation. Eventually, the woman stops looking after herself, with low self-esteem, grows less and less attractive for the man. While the man eventually completely avoids interacting with his wife. If we take this dynamics and reverse it a completely different story emerges. If both men and women are working they are starting their relationship at an equal footing. Both men and women have a lot to talk about their day towards the end. They both can give space to each other because they know what it is like working long hours. Both of them are able to build a sense of respect and understanding towards each other and develop a bond. The woman now has a purpose other than sitting at home. And, they both can find valuable time sharing with each other in doing activities outside of work. The woman no longer feels that marriage is the end of everything. In fact, it becomes a beginning of something good and nurturing. They both can share their responsibilities of doing the chores and financially contribute towards bills. Man is no longer the only breadwinner, nor the woman the only homemaker. Married life really only works when we remove the assignment of gender roles between two people who can first start out as friends (respect, understanding, support), then partners (love, trust, consideration) then husband/wife (commitment, responsibility, communication). However, in western society, marriage no longer has any real value nor is it necessary. Invariably, infidelity seems to be considered as an established norm. But, this rarely provides for a meaningful relationship. If the initial foundations of a relationship are weak it will hardly lead to much down the line. And, loyalty usually goes amiss if there is something missing in the relationship that hasn't been resolved from the start which brews into further issues down the line. In west, caring and sympathetic nature towards the other is rarely considered as people tend to be brought up in a selfish individualistic mindset where there is lack of patience and minor arguments can turn into major issues. And, counselling only makes things worse because bringing in a third-party is not only felt as intrusive, it is likely that mediator also has relationship issues of their own. Humans are not perfect nor should we be expected to be in society. Dynamics of society may bring their own environmental issues into the mix. It is how we deal with the ups and downs, as mature individuals, that determine the relative foundations we have in a relationship and whether they can withstand the test of time. In fact, as time goes on the spark that two people had at the start of a relationship may over time change and turn into something completely different which is inevitable. Perhaps, enriched and blossomed with something even better. Or, something that dies away and withers in time. As they say, it takes two to tango and the degree to which two people are willing to go to make it work. Psychology of relationships is an interesting area for modelling artificial intelligence and to decipher the many solutions to dynamic issues. Marriage counselling through artificial intelligence can mean a lucrative solution within the confines of privacy. In fact, why must people visit a human especially as humans are born imperfect which is not only inconvenient but also uncomfortable for many.
17 December 2020
16 December 2020
15 December 2020
14 December 2020
13 December 2020
12 December 2020
11 December 2020
Should You Use Flink
Flink is currently a very unstable platform. They have re-instituted the FlinkML which is unstable. They have rebalanced the graph option and the introduction of table. Any stable work now really depends on Spark. The Flink team really need to make up their minds and get their heads around stream processing and the abstracted features they want to provide to the stack. In fact, the Python option is just riddled with bugs. Perhaps, waiting a while might make the entire platform more stable but that is dependent on the goals of the team in the near future. Even the documentation is going slightly pair shaped. When a core aspect of a platform changes, it is best to fork it into a completely separate project. However, this fundamental shift, is what has made the Flink platform so unstable and the documentation untrackable. Maybe, in near future something better would come along to replace Spark and Flink that is ready for commercial use. But, so far it seems Spark is the only real contender in the market, albeit slightly unstable in its own right providing sufficient amount of flexibility without the added frustration.
8 December 2020
Lucidea
Mondeca
Wordmap
CoreOn
6 December 2020
4 December 2020
BioMedical Data Sources
- PubMed
- PubMed Central
- BioMed Central
- PubChem
- Medline
- ClinicalTrials
- OBOFoundry
- Galen
- SNOMed
- UMLS
- ON9
- MeSH
- ICD-10
- GO
- BioPortal
- CARO
- DO
- FMA
- HPO
- IDO
- MGED
- MP
- OBI
- OCI
- OGMS
- PATO
- VO
- Meddra
- RePorter
- Toxline
- BioPAX
- DrugBank
- Uniprot
- NCBI
- BIOGrid
- CellMap
- ChEBI
- ChEMBL
- DailyMed
- Diseasome
- HapMap
- HomoloGene
- HPRD
- HumanCYC
- HumanPhenotypeOntology
- IMID
- IntAct
- MINT
- NCBI Gene
- NCBI Nature
- PBD
- Pfam
- Pfam-A
- Pfam-B
- Reactome
- RxNorm
- SIDER
- SymptomOntology
Why GCP Sucks
GCP is one of the most unreliable cloud solutions from a so called reliable provider - Google. In fact, so many services are a bad reflection of their lack of technical ability which operates like a third-rate product initiative. The entire platform is not only rigid but lacks sufficient variety of service offerings to match the multitude of domain applications. In fact, it cannot be stressed enough that the entire platform runs like an experiment where the services are at best buggy and over-priced. For any architectural decision, it will behoove you to wonder that it can most likely be done better on AWS and with a more flexible pricing option. In fact, there are no shortage of engineers and architects that are sufficiently qualified on AWS in the market to help an organization in their ramp-up time. GCP in many respects also lacks sufficient compliance and governance features. And, sudden robotic alerts are common to reek untimely havoc into your otherwise smooth running systems causing abrupt shutdowns, no advanced warnings, and extremely short three day violation messages. Online support is virtually none and trying to get hold of someone is extremely difficult - lack of human element is not only bizarre it spells unreliable alarm bells during critical outage situations. Most of the AI services, especially natural language related, are at best terribly executed and accuracy is atrocious. Everything on GCP is a bad reflection of what internally Google is really like: arrogant, disorganized, and overly bureaucratic where they forget about the customer's needs. In fact, even the monitoring service is terrible. Most of their databases focus on SQL, aren't we in a world where NoSQL is the norm? Google has always been terrible at understanding semantic graph concepts, literally everything they touch turns into a probabilistic problem of approximations. Where it lacks in quality and variety of services, it makes up for in devops innovations. It may be a good option where applications are still experimental and in the development stage. However, the reliability is so bad that it leaves little reason to build anything on the platform that may one day be for production use. By the time application is ready to go live, the ongoing frustration will drive one nuts on the platform to want to switch to AWS.
27 November 2020
26 November 2020
24 November 2020
Hybrid Methods
Combining semantic rule-based methods with machine learning are the best hybrid methods for optimal contextualized results. A pure machine learning solution is rarely going to understand semantics of data, and will always return some form of confidence score as an approximation with no exact results. A pure semantic method will provide exact results based on inference and reasoning constraints. A semantic solution is also logically testable through standard programmatic methods. A machine learning solution can at best be evaluated on approximates for which formal explainability and interpretability methods are required in the process for compliance. Semantics can take the form of ontological representations like knowledge graphs and commonsense reasoning methods. When both approaches are combined there is an increase in connected context and semantics which is beneficial for formal artificial intelligence driven systems. It also provides for better transfer learning and a way of managing governance. Such methods when combined add significant accessibility value to business in form of natural language interpretability, integration, feature engineering from standards compliance, and for human-computer interaction interfaces. In essence, data is transformed into knowledge and information through the enrichment of machine-interpretable context that can grow through the mechanisms of self-learning, self-experience, and inevitably develop self-awareness about the environment. The semantic aspects act as a simulated form of associated memories that are available for forming new data associations and relationships. Such memories are formed through aspects of persistence in a knowledge graph that could be treated as in-memory cache for short-term processing and storage for long-term processing. In process, the machine learning and semantic methods feed of each other to increase in learnability and comprehension about the domain targets within the aspects of an open-world. The entire world wide web is based on the aspects of semantic resources making the internet accessible for browsing, searching, and findability. Metadata is in every technological software and hardware in use today. However, such metadata requires semantic enrichment to enable machine-interpretability which can be achieved through semantic standards. In fact, many programming languages are built using similar theoretical underpinnings of compilers, interpreters, parsers, semantics, and syntax. Many of these methods for decades have shown to be sufficiently plausible in industrial business use cases. In many respects, these are similar aspects, albeit in simpler abstractions, to human intelligence processes that are far more intricate and complex in nature. Rarely, do humans think in statistics for pattern matching and recognition where many of such processes are driven through semantic associations and are reinforced through experience. A pure statistical machine is always going to be fairly unsure about the world if based on approximations. The semantics will give it an edge to formulate meaningful associations from the world through domain relevant experiences which it is then able to interpret and analyze in context.
20 November 2020
Why do you need a Phd
A Phd in all intents and purposes adds little value for business. Although saying that, pretentious investors do look for Phd caliber individuals at startups in order to secure funding. In such cases, the hypocrisy can be seen in the background of such investors who may have even worked for companies that were founded by college dropouts. Publication of research papers does not generate any revenue for an organization either other than to gain notoriety. Considering that 80% of all published research work amounts to nothing that is a lot of investment in wasted time and man hours. By the time, a Phd candidate completes their dissertation work, it is already outdated to be of any significant real use to industry. A Phd work can last anywhere between 3 to 6 years, considering market movements and the advancements in technology in industry, apart from outdated practical skills they would have very little to offer at time of graduation. In many cases, an entire support function needs to be developed by an organization to cater for Phd individuals who will even need help in refactoring and scaling their work. In most cases, they will need a lot of mentoring and training for all the basic skills that they should have learned in academics that a practically applied individual with work experience already would have to offer an organization. It is questionable what quantifiable work they produce if additional resources are needed to make it of any use to business. Invariably, theory does not supersede in practice. In many cases, theory may not be plausible to implement in practice due to uncertainty and complexity reasons for which many Phd individuals have very little experience of outside of academics. And, putting them in line of influence on business product initiatives is a big risk as they come with very little practical experience. Academics is very different from how things get done in the practical world. Who really is defined as a domain expert? Is it one that has studied a topic for decades with published papers in a sheltered environment or one that has learned the art through delivering practical projects across industry domains? In fact, as more clueless Phd people find work in industry this has driven a shortage in academic institutions. This is also an indication of how bad teaching is in academia and how out of touch it is with the complexities of the real world. Considering, only one percent of the world educated population holds a Phd, it is hardly wonder they won't be missed much in industry. Perhaps, people with Phds should really stay in academics where they can teach (lack thereof to improve their teaching skills) and publish papers (lack thereof to improve on quality research) within the confines of a protected institution and leave the practical aspects to the experienced experts in business.
18 November 2020
15 November 2020
13 November 2020
Why do you require a Data Engineer
Companies need to fundamentally ask the question - why do you require a data engineer on your team? The role of a data engineer spans 80% of the data science method. If a data engineer is expected to work alongside data scientists then there is a valid assumption one can make about the incompetence of such data scientists especially as they are only able to fulfil 20% of their job responsibilities as per the data science method. The next follow up question naturally arises - why do you require a data engineer on your team if you already have a team of data scientists? A competent data scientist's job is to be responsible for the entire end-to-end data science method which doesn't mean just building the models but also to pipeline and scale them so they are of use to the company. In fact, they also are responsible for doing their own feature engineering. Otherwise, this begs the question - how can one build a model without doing the feature engineering themselves? And, it further begs the question, that if feature engineering work is done by someone else other than the one building the model, it is highly likely that it would lead to an overfitted model solution that only partially solves the business case. This means there is no real need to hire a data engineer if there is already a team of data scientists. In fact, companies can significantly reduce their hiring costs by just hiring people that know and understand the data science method. Either only have a team of data engineers or only a team of data scientists as it makes literally no sense of hiring both in a team. The next follow-up question could be - if you already have data scientists on your team, then is there any point in hiring a machine learning engineer? With so many role overlaps it only spells more and more incompetence and clueless team members not to mention frustration in hiring so many people. Companies can save on costs significantly by hiring capable people that have the sense to understand their job functions and have the relevant practical skills. Furthermore, one can then proceed to ask the next question - why do you require a data engineer, a data scientist, and even a machine learning engineer, if you could just hire a team of AI engineers? Machine learning is part of AI, data engineering is part of data science function, data science is pretty much what AI engineers can also do. And, to be able to act as AI engineers you have to understand aspects of computer science. Which leads to the last and final question - ultimately don't they all require a competent software engineer with practical skills that can deliver solutions for business? Invariably, what we find in industry is that the weakest link is the data scientist, and this is usually because companies are tarred with recruiting Phds who have no practical skills apart from their academic theory that adds little quantifiable value for business in delivering products, they neither come with software engineering skills nor do they have the applied experience mindset to appreciate it. Precisely, why so many data scientists in industry are only interested in building models, because in academics, they are rarely taught the importance of feature engineering, of how to build a scalable pipeline, nor about the entire data science method. Essentially, the data engineer role becomes a gap filler for all the inadequacies and deficiencies of an incompetent data scientist resource.
3 November 2020
2 November 2020
Are Men and Women Equal
Mathematically, two things are equal when they are exactly the same and when they represent the same object. Equality is symmetric, transitive, and reflexive. For men and women to be equal, they would have to be identically the same in attributes. Biologically, men and women are distinctly different. Men cannot give birth. Men cannot get periods. Although, physically men and women may look similar to some degree. However, mentally and hormonally they may also differ. Essentially, they have different attributes that define their aggregate makeup. Generally, laws of most countries, on paper, only recognize two genders male and female, so we can reduce the complexity here from infinite genders. Since male has XY chromosome and female has XX chromosome, for female to equal to male, X would have to equal to Y. The laws of equality dictate that if x = y, then y = x. That would invalidate the existence of two sexes. For a woman to exist or continue to exist, one would need XY for reproduction and variety assuming the first woman born cannot be born pregnant. In order for the population to grow, there is a need for XY chromosome. Although, this is quite an oversimplification from the various mutations that can occur. Lets assume X = 1, and Y = 2. In which case, through addition, X+X = 2 and X+Y = 3, 2 is not equal to 3. Lets assume the second scenario of X = Y, Y = X. In which case X+X = 2 and X+Y (which can be replaced by X) = 2 but this is not possible because that would mean both are women under the laws of equality which would refute the claim for having two sexes man and a woman. The alternative also does not hold when X is substituted for Y in which case both cannot be men. But, since men and women do exist, equality cannot hold. Precisely, why a woman is called a woman, and a man is called a man, because the two are dissimilar. In fact, one can substitute addition for multiplication and derive at the same answer. Which would imply that Y cannot equal to X, therefore XX cannot equal to XY. Invariably, in society we still find women expecting men to pay after them and even to the point of bending the knee for a proposal. Rarely, do we see a woman bending the knee to propose to a man. Women often take pride in the fact that they can multitask, while at the same time making the implication that men cannot. By making such sweeping statements, perhaps, they don't realize they put themselves in a self-defeating corner of any justifiable evidence for equality. Also, stating that biological males should not be competing against biological females automatically dismisses any argument in support for equality. Separate public bathrooms for men and women also disproves the notion of equality. Laws in most societies are there to protect women in terms of abortions, custody battles, alimony, and child support. When it comes to abortions women talk about "my body, my choice" and ignore the rights of the father. However, when the child is born, suddenly the rights of the father become apparent to the woman when she demands child support. Such expectations clearly do not define equality. However, what they do display is hypocritical double standards. Hence, they may have different needs but similar wants which is what society defines as equality. An obvious case can be found in their differential needs in a typical supermarket where women personal care aisle is separate from the men personal care aisle. If they were equal they would both share essentially the same needs in personal care products. That would imply that emotions aside, logically men and women are two distinct types of humans that cannot be equal to each other. In fact, one can go further, and make the assumption under the laws of equality for the cases of both men and women that they are essentially defined as unique individuals with uniquely defined attributes that measure the summation of their identities. No two men are ever alike. Just like no two women are ever alike. We are all unique in our own way. In fact, even identical twins eventually develop variants. In multi-faceted societies, gender roles often are affected by environment factors that go beyond economic, social, and cultural divides that influence the makeup of individual personalities and identities.
1 November 2020
Linear Annotations
24 October 2020
Ethicists Are Unethical
Ethicists are not very ethical individuals. In fact, they are in a profession of knowing what is right and wrong. Perhaps, it is this confidence at knowing what is right and wrong that makes them less likely to act ethically in real life. In many respects, an ethicist is the most likely hypocrite because even after knowing something is wrong they are willing to commit the act. They are often seen preaching for the right things, but hardly applying any of it in practice in their own life. Furthermore, AI ethics is doomed if a human ethicist is relied on to develop the guiding principles as they are naturally unqualified. It is this feeling of entitlement that makes one consider doing something bad. They are also most likely to suffer from the god complex. Ethical and moral judgement is clouded in human nature. Do we really need to implement and replicate this flawed human nature in AI? In many cases, regular introspection and self-reflection are fundamentally important aspects of ethics and morality which may need to be extended into the generalizable AI machine.
22 October 2020
19 October 2020
Fraser
Fred
10 October 2020
9 October 2020
Is There Really A Skills Shortage
In most cases, in industry, a skills shortage, invariably does not exist. There is always a case for more supply of skills than there is demand for them. Hence, why in many countries there is always a percentage of unemployment. People are willing and able to work. However, the problem stems from the fact that employers filter out perfectly good CVs. This may be a result of their own biases, their sense of likability, as a result of keyword hunting, their need to want more skills for substantially less pay, or the fact that they don't care to read the full candidate profile. In many organizations the first people that get involved in filtering CVs are non-technical individuals who have no understanding of the skills nor the context of how to use them. By the time the hiring manager receives the profiles they have already been whittled down through recruitment bias. While through the interview selection process even further bias is applied and in turn the person they decide to recruit may not necessarily even be the best candidate in the pile of CV applications. For many roles, a recruiter may receive anywhere from one to hundreds of CVs of which many are likely to be suitable for candidacy. The recruitment process is not very fair for candidates as it is a one-sided process to favor employers. There is relatively little respect or consideration for candidates during the application or through a conscious feedback process from the employer. In many cases, GDPR processes at organizations may not even be fully compliant nor provide transparency in regards to how the personal details have been processed and stored of candidates. Recruiters invariably may pass on CVs to managers who may then pass on CVs to other members of the team, all the while such personal details are being stored on multiple email accounts and may even get printed out in hardcopy. One may even notice the reckless use of candidate CVs as scratch paper. In some cases, consent to pass on details may be provided to recruiter but for whatever reason the recruiter may not decide to represent the candidate, during this time the candidate may not have any feedback of where and how their personal details have been processed, stored, or passed on. There needs to be more done to protect the rights of individuals and their personal details both when they are applying as candidates and when they transition into employees. Managers and recruiters seem to forget that the very candidate they mistreat, disrespect, or are inconsiderate towards during an application process could one day become the founder of a company that they may want to work with in the future. An individual deserves just as much respect when they are a candidate, when they are an employee, and as an employer. Can AI really be a solution towards solving many of the above issues created by humans? Perhaps, only if, managerial politics and biases in organizations can be removed from the equation.
2 October 2020
1 October 2020
30 September 2020
29 September 2020
13 September 2020
Lime
emrQA
9 September 2020
8 September 2020
1 September 2020
26 August 2020
23 August 2020
22 August 2020
20 August 2020
14 August 2020
11 August 2020
8 August 2020
31 July 2020
30 July 2020
27 July 2020
24 July 2020
20 July 2020
Rust Is Not Yet A Better Language
- Questionable and at times dodgy Rust arithmetic
- Functional calls touch memory twice in Rust
- Rust is not faster than C++
- Unproven safety mechanism in Rust
- Painful rewrite of C library headers
- Compilation times are slow
- Rust is a pain, lacks transparency, and inconsistent to work with compiler where some things are documented rather than properly checked
- Integration with other languages is difficult
- Rust has a bigger assembly code footprint than C++
- Unsafe blocks are not checked
- Most of enterprise and technology products are built using C/C++/Java interface where a complete rewrite might be required for Rust
- Rust doesn't play nice with other languages
- Rust ecosystem tools are insufficient for prime time use
- Tedious and verbose
- No formal community specifications and release process
18 July 2020
L4-L7 Network Services Definition
16 July 2020
15 July 2020
Optimization
- Don't bother optimizing a solution in prototype mode, focus on solving the problem (what if the solution is incorrect, one might be wasting time optimizing an incorrect solution)
- Focus on testing the implementation
- Once the implementation is correct, refactor it - focus on high cohesion, loose coupling
- Keep implementation loosely coupled from third-party libraries (also, don't get hung up on such things as whether the third-party library is using a c implementation for optimization, it is more important at this stage to make sure the algorithmic implementation is correct)
- Treat third-party libraries as dependencies
- Use profilers and correct metrics to check for performance
- If there is a bottleneck identify where it is at application-level or system-level
- Only optimize as an afterthought and when a bottleneck is identified (How can one optimize for something without knowing where the bottleneck is? In some cases, through experience, one might even know earlier in the implementation where a bottleneck might occur, in which case, eager optimization can be compensated from experience and may in fact be beneficial to save time later in the process)
- Don't optimize for the sake of optimizing, it may be unnecessary, especially in the cloud
- When a dependency is the bottleneck, replace it with another, more performant dependency or create own
- Swapping dependencies should not affect the algorithmic implementation (as long as the algorithmic implementation to use is also correct in the third-party dependency), which is the whole point of using functions as blackboxes that provide parameter passing. Create a wrapper if needed. Use appropriate best practices and patterns.
- Is the bottleneck at application-level negligible enough to be offloaded to system-level cloud infrastructure?
- Don't eagerly optimize early and often - premature optimization only leads to more issues and complexity
- Only optimize when there is a logical need to do so (be pragmatic)
- With increasing level of experience, one can deduce when, where, and how to optimize for the outcome of results
14 July 2020
kops
8 July 2020
6 July 2020
3 July 2020
1 July 2020
Types of Narratives
- Empirical Narratives
- Fictional Narratives
Narratology
- Cognitive Narratology
- Contextualist Narratology
Discourse Modes
- Narrative
- Argumentative
- Expository
- Descriptive
- Instructive
29 June 2020
Useless Managers
Loss of Creativity in Academics
28 June 2020
22 June 2020
20 June 2020
Tensorflow Tools
- ML Metadata
- Data Visualization
- Serving
- Tensorflow.js
- Transform
- Model Analysis (TFX Model Pusher and TFX Model Validator)
- Lite
- Privacy
- Federated
- CoLab
- Probability
- Graphics
- Agents
- Ranking
- Quantum
- Magenta
- TensorRT
- Tensor2Tensor
- Tensorboard
- Extended
- Sonnet
- Dopamine
- Lattice
- Model Optimization
- Hub
- RaggedTensors
- Mesh
- I/O
- TRFL
- Unicode Ops
Incompetent Graduates
17 June 2020
14 June 2020
13 June 2020
Argument Mining Corpora
- IAC
- ABCD
- AWTP
- ComArg
- Technical Blogs
- Web Discourse
- Araucaria
- Argumentative Microtext Corpus
- News Editorials
- Persuasive Essay Corpus
AIFDB
10 June 2020
Applications of Metaphor Processing
- Creative Writing
- Joke Generators
- Figurative Information Retrieval
- Narrative Generators
- Sentiment Recognition
- Persuasive Marketing
- Commonsense Reasoning
- Political Communication
- Discourse Analysis
- Reading Comprehension
- Review Generators
- Poetry Generators
- Lyrics Generators
- Slogan Generators
8 June 2020
3 June 2020
2 June 2020
1 June 2020
30 May 2020
SimFin
24 May 2020
Magenta
23 May 2020
22 May 2020
20 May 2020
GNN Datasets and Implementations
- PubMed
- Cora
- Citeseer
- DBLP
Biochemical:
- MUTAG
- NCI-1
- PPI
- D&D
- PROTEIN
Social Networks:
- BlogCatalog
Knowledge Graphs:
- FB13
- FB15K
- FB15K237
- WN11
- WN18
- WN18RR
Repos:
- Network Repository
- Graph Kernel Datasets
- Relational Dataset Repository
- Stanford Large Network Dataset Collection
- Open Graph Benchmark
Implementations:
GNN Models:
- GGNN
- Neural FPs
- ChebNet
- DNGR
- SDNE
- GAE
- DRNE
- Structured RNN
- DCNN
- GCN
- CayleyNet
- GraphSage
- GAT
- CLN
- ECC
- MPNN
- MoNet
- JK-Net
- SSE
- LGCN
- FastGCN
- DiffPool
- GraphRNN
- MolGAN
- NetGAN
- DCRNN
- ST-GCN
- RGCN
- AS-GCN
- DGCN
- GaAN
- DGI
- GraphWaveNet
- HAN
Deep Fact Checking
12 May 2020
Text Production Datasets
- WikiBio
- WikiNLG
- SBNation
- RotoWire
- SR'18
- E2E
- Summarization (DUC2001-2005)
- CNN
- DailyMail
- NYTimes
- NewsRoom
- XSum
- Simplification
- PWKP
- WikiLarge
- Newsela
- Compression
- Gigaword
- Automatic Creation of Extractive Sentence/Compression
- MASC
- Multi-Reference Corpus for Abstractive Compression
- Cohn and Lapata's Corpus
- Paraphrasing
- MSRP
- PIT-2015
- Twitter News URL Corpus
- ParaNMT-80
- ParaNMT-50
- MTC
- PPDB
7 May 2020
6 May 2020
5 May 2020
Moogsoft
4 May 2020
Social Mixed Reality
3 May 2020
22 April 2020
20 April 2020
17 April 2020
When is a University Degree Pointless?
- When you need mentors in workplace to teach you how to do everything?
- When you need help using google to search for information?
- When your only way of learning anything new is by asking or expecting others to show you how it is done?
- When you can't seem to understand anything and often in workplace you use phrases like 'I don't understand'?
- When you have no applied skills even if it is intuitive or requires basic common sense?
- When you can learn more by doing rather than by sitting in a classroom?
- When your entire objective of learning is to pass an exam and/or course?
- When you spend your time sharing academic theory but have literally no practical awareness of how to apply any of it?
- When you can understand everything by just picking up a book or an online tutorial then applying it yourself?
- When you understand the advanced theoretical concepts but have no clue about the basic mechanics of it?
- When your lack of academic integrity extends into your bad work ethics and behavior?
- When you share an air of arrogance about having achieved a degree by looking down on people with many years of practical experience in workplace?
- When you have little respect and consideration for others in workplace?
- When you unable to meet and agree on sensible timelines?
- When you struggle to micro-manage and organize your own work habits?
- When you sit there in workplace playing politics and blame games with everyone?
- When you are not proactive and sufficiently resourceful in the workplace?
- When you struggle to make decisions and reason through things especially when things go wrong?
- When you need help with everything including things that are a no-brainer?
- When you outright dismiss new ways of doing things, as part of your narrow-mindedness, without keeping an open-mind for critical exploration and assessment?
- When you don't give credit where credit is due?
- When you exhibit discrimination, racism, and biases in workplace in the way you treat others?
- When you expect the opposite gender to clean up after you and treat them as less equal to you?
- When the manifestations of your mannerisms and attitudes display a lack of professionalism?
- When you feel the need to argue about everything even things that don't require a discussion?
- When you find it difficult to adapt to change?
- When you prefer to work in a routine?
- When you don't learn from your mistakes?
- When your degree is unrecognized and unaccredited as a valid degree?
- When you find yourself questioning the value of your degree and the time spent towards achieving it?
- When you have sufficient practical experience that no one bothers to even care what university you went to and what degree you earned?
- When after earning a degree you are still doing a menial job that didn't even require a qualification?
- When majority of your learning happens in the school of life rather than in a classroom?
- When you are still clueless about what you want to do with life and how getting a degree will make that happen?
- When you look back and you realize you should have done a degree in another subject instead?
- When you are still a reckless and unproductive mess in society?
- When you realize your intention of attaining a degree was to please your parents rather than being aligned with your interests and talents?
- When you use wikipedia references as a source of your knowledge?
- When you rely on conspiracy theories, gossips and rumors as a way to justify your claims?
- When you only know how to regurgitate things?
- When you can't see any sense in applying best practices?
- When you discredit others based on their backgrounds rather than objectively evaluating the quality of their work?
- When you try to take credit for other people's work?
- When the name of an institution you attended is more important and significant in value to you in comparison to the content of the course?
- When you disrupt and undermine other people's work while requiring help to do your own work?
- When you unethically use, exploit, and walk over other people to help advance your own career?
- When you had to bribe your way into earning a university degree?
- When you had to pay someone on a crowdsourcing site to do the entire coursework or dissertation for you?
- When you cheat your way through an exam or a coursework?
- When your degree course has literally no coverage on applied ethics in broader and narrower terms whether as part of university-wide or course-specific initiative?
- When your investor only cares about what university you went to rather than the potential of your proposed product?
- When your employer or interviewer cares more about what university you went to rather than your skills and experience?
- When you try to spend time optimizing a solution even before you have fully understood the business case and solved the problem in a prototype?
- When you are unwilling to challenge the norm and status quo especially if it is incorrect nor the better way of doing things?
- When you can't be bothered to read the documentation?
9 April 2020
8 April 2020
6 April 2020
FoodKG
5 April 2020
Uvicorn
Daphne
4 April 2020
Knowledge Graph Embeddings
- analogy
- box
- capse
- care
- complex
- conmask
- conve
- convkb
- crosse
- ctransr
- dihedral
- dismult
- dkrl
- ebemkg
- eakgae
- ext-rescal
- kgcompletion
- gake
- hole
- ikrl
- inferbeddings
- kale
- kbgan
- kbbert
- kblrn
- kdcoe
- kg2e
- kg2vec
- kgelda
- kglove
- krear
- lfm
- literale
- manifolde
- mfold
- mkbe
- mlp
- mtkgnn
- ntn
- neurallp
- proje
- proppr
- ptranse
- rdf2vec
- resource2vec
- rescal
- rgcn
- rotate
- rsn
- rtranse
- se
- simple
- sme
- ssp
- stranse
- tatec
- teke
- tkrl
- toruse
- transa
- transd
- transe
- transea
- transf
- transg
- transh
- transm
- transparse
- transr
- transt
- trescal
- tucker
- um
- wiki2vec
2 April 2020
1 April 2020
31 March 2020
CSVW
CSVW Namespace Vocabulary Terms
Generating RDF from Tabular Data on Web
Embedding Tabular Data in HTML
CSV on Web Use Cases
Generating JSON from Tabular Data on Web
Metadata Vocabulary for Tabular Data
Model for Tabular Data and Metadata on Web
CSVLint
30 March 2020
26 March 2020
25 March 2020
Fake Data Scientists
- they have no clue about the processes of a data science method
- they skip the feature engineering part of the data science method
- they require data engineers to provide them cleaned data through an ETL process
- they need a whole team of technical people to support their work
- they are only interested in building models and the models they build inherently are almost always overfitted as they never bother to do the feature engineering work themselves
- they don't consider creating their own corpus as an important step of model build work
- they don't understand the value of features when training a model to solve a business case
- they have no clue how to scale, build, deploy, evaluate their models into production
- they think with a phd they know everything but practically they are zero
- they rarely bother to understand the business case nor ask the right questions
- they don't know how to augment the data to create their own corpus for training
- they don't know how to apply feature selection
- they don't know how to generalize a model so they are sat there re-tunning their overfitted models
- they spend years and years sitting in organizations building overfitted models when they could have built generalizable models in weeks and months
- they don't understand the value of metadata or the value of knowledge graphs for feature engineering
- they raise ridiculously dumb issues during agile standups like they have built a model but it doesn't have certain features (i.e they skip the feature engineering step)
- they build a model straight out of a research paper and assume the exploratory step is the entire data science method
- they use classification approaches when they should be using clustering methods
- they are unwilling to learn new ways of doing things nor are willing to adapt to change
- they prefer to use notebooks rather than build a full structured implementation of their models that can be deployed to production
- they build models that contain no formal evaluation or testing metrics
- they only partially solve a business case because they skipped the feature engineering or passed that effort to a data engineer
- they are only interested in quantitative methods and not willing to think outside the box of what they have been taught in academics
- they build academic models that are not fit for purpose for production nor do they add business value
- they require a lot of handholding and mentoring to be taught basic coding skills
- they struggle to understand research papers nor the fact that 80% of such research work is useless and of no inherent value
- they literally assume that something is state of the art when it is mentioned in a research paper rather than contextualize the model appropriateness to solve a business case
- they don't bother to visualize the data as part of exploration stage
- they don't bother to do background research to identify use cases where a certain approach has worked or not worked for a business
- they don't bother to look at reuse appropriately
- they have no understanding of how to clean data
- they try every model type until something sticks
- they don't have clarity on how the different model types work
- they don't fully understand the appropriate context of when to apply a model type
- they only know very few model methods and how to approach them for a narrow set of business cases
- they don't understand bias and variance
- they don't know whether they want accuracy or interpretability nor how to pick
- they don't know what a baseline is
- they use the wrong sets of metrics
- they incorrectly apply the train, validation, test split
- they go to the other extreme of focusing on optimization before actually solving the problem
- they have a phd and the arrogance to match, but literally no practical experience of how to be productive in applying any of it in the workplace especially against noisy unstructured data
- they come with fancy phds and spend time teaching others how to do their job, but usually require the help of everyone on team to do their own work
- they come with a phd in a specific area but have no willingness to understand other scientific disciplines in the application of data or have a tendency to outright dismiss such methods
- they think AI is just machine learning
- they want someone to hand them a clean dataset on a gold platter because they can't be bothered to do it themselves nor do they think it is an important aspect of their work
- they can't seem to think beyond statistics to solve a problem
- they have tendency of looking down on people and dismissing any one that doesn't hold a phd
- they struggle to understand basic concepts in computer science
- they need a separate resource to help them refactor their code nor will they be bothered to do it themselves
- they find services like datarobot helps their work in automating machine learning especially feature engineering which inherently allows them to build overfitted models much faster
- they can't tell the difference between structured and unstructured data
- they don't have a clue how to deal with noisy data
- they not very resourceful in hunting for datasets as part of a curation step
- they need to be shown how to google for things and basically someone constantly showing them how to do things to be practical in the workplace
- they prefer to use GUI interfaces that allow them to simply use buttons and drag/drop to build models rather than hand build it themselves
- they state that they have been a data scientist for last 20 years when the field only went mainstream in industry for last 4 or 5 years (an indication of the designated role is evidence from when it first started advertising on recruitment boards and within organizations)
- they want to apply machine learning to everything, even where it may be overkill
- they hold phd but are more than happy to plagiarize other people's work and try to take credit for it, in many cases their bit is probably just exposing it as an API
- they hold a phd but try to take credit of the entire work, even when someone else or an entire team has probably done 80% of their work
- they use personal pronouns like 'I' in most cases, but rarely do they use 'We' when working in the team
- they only care about their inputs, outputs, and dependencies for building a model rather than being flexible, considerate, and thinking as a team in looking at the bigger picture
- if your 'head of data science' uses terms like 'I don't understand' to the point of annoyance then it is a likely indication of their technical incompetence and ability in that capacity
- they think decision trees is just a bunch of rules and not a type of machine learning technique
24 March 2020
JATS-XML
23 March 2020
20 March 2020
Metadata Validators
Google Structured Data Testing Tool
Yandex Structured Data Testing Tool
Structured Data Linter
RDF to SVG Bookmarklet
Bing Markup Validator
Apple App Search API Validation Tool
Open Graph Debugger
RDFa Play
Microdata Parser
Google Data Feed Validation Tool
JSON-LD Playground
Schema-DTS
Email Markup Tester
OpenLink Structured Data Sniffer
Microdata.reveal
Microdata/JSON-LD sniffer
Semantic Inspector
META SEO Inspector
Green Turtle RDFa
RDF Translator
Convert RDFa to JSON-LD
Convert Wikipedia URL to DBPedia URL
Wikidata Lookup by Name
Sindice Web Data Inspector
Corporate Contacts Markup Tester
Event Markup Tester
17 March 2020
9 March 2020
Informal To Formal Ontology Terminology
- building a model of some subject matter -> building an ontology
- things -> individuals
- kinds of things -> classes
- generalizations/specializations -> subClassOf
- some kind of thing -> instance or member of class
- literal is of a certain kind -> has a datatype
- relationships between things -> object properties
- attributes of things -> data properties
- kinds of literals -> datatypes
- saying something -> asserting a triple
- drawing conclusions -> inference
8 March 2020
Bartoc
Industry-Specific Taxonomies and Ontologies
- Universal Decimal Classification Summary
- School Online Thesaurus
- Getty Art & Architecture Thesaurus
- Unesco Thesaurus
- GeoNames
- Springer Nature SciGraph
- UKAT Archival Thesaurus
Financial:
Energy & Environment:
Ecommerce & SEO:
Pharma & Healthcare:
Business:
- ACM Computing Classification
- Information Technology Glossary
- ITIL
- UDC
- Springer Nature SciGraph
- GESIS
- BioPortal