OM-DTA-002 Data Analyst

Provides the support, administration, and maintenance necessary to ensure effective and efficient information technology (IT) system performance and security.

Develops and administers databases and/or data management systems that allow for the storage, query, protection, and utilization of data.

Examines data from multiple disparate sources with the goal of providing security and privacy insight. Designs and implements custom algorithms, workflow processes, and layouts for complex, enterprise-scale data sets used for modeling, data mining, and research purposes.

Knowledges 32

Code Description Work Roles
K0001 Knowledge of computer networking concepts and protocols, and network security methodologies. 52
K0002 Knowledge of risk management processes (e.g., methods for assessing and mitigating risk). 52
K0003 Knowledge of laws, regulations, policies, and ethics as they relate to cybersecurity and privacy. 52
K0004 Knowledge of cybersecurity and privacy principles. 52
K0005 Knowledge of cyber threats and vulnerabilities. 52
K0006 Knowledge of specific operational impacts of cybersecurity lapses. 52
K0015 Knowledge of computer algorithms. 6
K0016 Knowledge of computer programming principles 3
K0020 Knowledge of data administration and data standardization policies. 2
K0022 Knowledge of data mining and data warehousing principles. 2
K0023 Knowledge of database management systems, query languages, table relationships, and views. 2
K0025 Knowledge of digital rights management. 2
K0031 Knowledge of enterprise messaging systems and associated software. 2
K0051 Knowledge of low-level computer languages (e.g., assembly languages). 4
K0052 Knowledge of mathematics (e.g. logarithms, trigonometry, linear algebra, calculus, statistics, and operational analysis). 6
K0056 Knowledge of network access, identity, and access management (e.g., public key infrastructure, Oauth, OpenID, SAML, SPML). 11
K0060 Knowledge of operating systems. 13
K0065 Knowledge of policy-based and risk adaptive access controls. 5
K0068 Knowledge of programming language structures and logic. 4
K0069 Knowledge of query languages such as SQL (structured query language). 2
K0083 Knowledge of sources, characteristics, and uses of the organization’s data assets. 2
K0095 Knowledge of the capabilities and functionality associated with various technologies for organizing and managing information (e.g., databases, bookmarking engines). 2
K0129 Knowledge of command-line tools (e.g., mkdir, mv, ls, passwd, grep). 1
K0139 Knowledge of interpreted and compiled computer languages. 8
K0140 Knowledge of secure coding techniques. 3
K0193 Knowledge of advanced data remediation security features in databases. 1
K0197 Knowledge of database access application programming interfaces (e.g., Java Database Connectivity [JDBC]). 2
K0229 Knowledge of applications that can log errors, exceptions, and application faults and logging. 1
K0236 Knowledge of how to utilize Hadoop, Java, Python, SQL, Hive, and Pig to explore data. 1
K0238 Knowledge of machine learning theory and principles. 1
K0325 Knowledge of Information Theory (e.g., source coding, channel coding, algorithm complexity theory, and data compression). 6
K0420 Knowledge of database theory. 4

Skills 26

Code Description Work Roles
S0013 Skill in conducting queries and developing algorithms to analyze data structures. 2
S0017 Skill in creating and utilizing mathematical or statistical models. 3
S0202 Skill in data mining techniques (e.g., searching file systems) and analysis. 2
S0028 Skill in developing data dictionaries. 1
S0029 Skill in developing data models. 1
S0037 Skill in generating queries and reports. 2
S0060 Skill in writing code in a currently supported programming language (e.g., Java, C++). 7
S0088 Skill in using binary analysis tools (e.g., Hexedit, command code xxd, hexdump). 3
S0089 Skill in one-way hash functions (e.g., Secure Hash Algorithm [SHA], Message Digest Algorithm [MD5]). 3
S0094 Skill in reading Hexadecimal data. 1
S0095 Skill in identifying common encoding techniques (e.g., Exclusive Disjunction [XOR], American Standard Code for Information Interchange [ASCII], Unicode, Base64, Uuencode, Uniform Resource Locator [URL] encode). 1
S0103 Skill in assessing the predictive power and subsequent generalizability of a model. 1
S0106 Skill in data pre-processing (e.g., imputation, dimensionality reduction, normalization, transformation, extraction, filtering, smoothing). 1
S0109 Skill in identifying hidden patterns or relationships. 1
S0113 Skill in performing format conversions to create a standard representation of the data. 1
S0114 Skill in performing sensitivity analysis. 1
S0118 Skill in developing machine understandable semantic ontologies. 1
S0119 Skill in Regression Analysis (e.g., Hierarchical Stepwise, Generalized Linear Model, Ordinary Least Squares, Tree-Based Methods, Logistic). 1
S0123 Skill in transformation analytics (e.g., aggregation, enrichment, processing). 1
S0125 Skill in using basic descriptive statistics and techniques (e.g., normality, model distribution, scatter plots). 1
S0126 Skill in using data analysis tools (e.g., Excel, STATA SAS, SPSS). 1
S0127 Skill in using data mapping tools. 1
S0129 Skill in using outlier identification and removal techniques. 1
S0130 Skill in writing scripts using R, Python, PIG, HIVE, SQL, etc. 1
S0160 Skill in the use of design modeling (e.g., unified modeling language). 3
S0369 Skill to identify sources, characteristics, and uses of the organization’s data assets. 1

Abilities 5

Code Description Work Roles
A0029 Ability to build complex data structures and high-level programming languages. 1
A0035 Ability to dissect a problem and examine the interrelationships between data that may appear unrelated. 2
A0036 Ability to identify basic common coding flaws at a high level. 2
A0041 Ability to use data visualization tools (e.g., Flare, HighCharts, AmCharts, D3.js, Processing, Google Visualization API, Tableau, Raphael.js). 1
A0066 Ability to accurately and completely source all data used in intelligence, assessment and/or planning products. 12

Tasks 23

Code Description Work Roles
T0342 Analyze data sources to provide actionable recommendations. 1
T0347 Assess the validity of source data and subsequent findings. 1
T0349 Collect metrics and trending data. 1
T0351 Conduct hypothesis testing using statistical processes. 1
T0353 Confer with systems analysts, engineers, programmers, and others to design application. 1
T0361 Develop and facilitate data-gathering methods. 1
T0366 Develop strategic insights from large data sets. 1
T0381 Present technical information to technical and nontechnical audiences. 3
T0382 Present data in creative formats. 2
T0383 Program custom algorithms. 1
T0385 Provide actionable recommendations to critical stakeholders based on data analysis and findings. 1
T0392 Utilize technical documentation or resources to implement a new mathematical, data science, or computer science method. 1
T0402 Effectively allocate storage capacity in the design of data management systems. 1
T0403 Read, interpret, write, modify, and execute simple scripts (e.g., Perl, VBScript) on Windows and UNIX systems (e.g., those that perform tasks such as: parsing large data files, automating manual tasks, and fetching/processing remote data). 2
T0404 Utilize different programming languages to write code, open files, read files, and write output to different files. 1
T0405 Utilize open source language such as R and apply quantitative techniques (e.g., descriptive and inferential statistics, sampling, experimental design, parametric and non-parametric tests of difference, ordinary least squares regression, general line). 1
T0460 Develop and implement data mining and data warehousing programs. 1
T0007 Analyze and define data requirements and specifications. 1
T0008 Analyze and plan for anticipated changes in data capacity requirements. 2
T0068 Develop data standards, policies, and procedures. 1
T0146 Manage the compilation, cataloging, caching, distribution, and retrieval of data. 2
T0195 Provide a managed flow of relevant information (via web-based portals or other means) based on mission requirements. 1
T0210 Provide recommendations on new database technologies and architectures. 2