OM-DTA-002 Data Analyst
Provides the support, administration, and maintenance necessary to ensure effective and efficient information technology (IT) system performance and security.
Develops and administers databases and/or data management systems that allow for the storage, query, protection, and utilization of data.
Examines data from multiple disparate sources with the goal of providing security and privacy insight. Designs and implements custom algorithms, workflow processes, and layouts for complex, enterprise-scale data sets used for modeling, data mining, and research purposes.
Knowledges 32
Code | Description | Work Roles |
---|---|---|
K0001 | Knowledge of computer networking concepts and protocols, and network security methodologies. | 52 |
K0002 | Knowledge of risk management processes (e.g., methods for assessing and mitigating risk). | 52 |
K0003 | Knowledge of laws, regulations, policies, and ethics as they relate to cybersecurity and privacy. | 52 |
K0004 | Knowledge of cybersecurity and privacy principles. | 52 |
K0005 | Knowledge of cyber threats and vulnerabilities. | 52 |
K0006 | Knowledge of specific operational impacts of cybersecurity lapses. | 52 |
K0015 | Knowledge of computer algorithms. | 6 |
K0016 | Knowledge of computer programming principles | 3 |
K0020 | Knowledge of data administration and data standardization policies. | 2 |
K0022 | Knowledge of data mining and data warehousing principles. | 2 |
K0023 | Knowledge of database management systems, query languages, table relationships, and views. | 2 |
K0025 | Knowledge of digital rights management. | 2 |
K0031 | Knowledge of enterprise messaging systems and associated software. | 2 |
K0051 | Knowledge of low-level computer languages (e.g., assembly languages). | 4 |
K0052 | Knowledge of mathematics (e.g. logarithms, trigonometry, linear algebra, calculus, statistics, and operational analysis). | 6 |
K0056 | Knowledge of network access, identity, and access management (e.g., public key infrastructure, Oauth, OpenID, SAML, SPML). | 11 |
K0060 | Knowledge of operating systems. | 13 |
K0065 | Knowledge of policy-based and risk adaptive access controls. | 5 |
K0068 | Knowledge of programming language structures and logic. | 4 |
K0069 | Knowledge of query languages such as SQL (structured query language). | 2 |
K0083 | Knowledge of sources, characteristics, and uses of the organization’s data assets. | 2 |
K0095 | Knowledge of the capabilities and functionality associated with various technologies for organizing and managing information (e.g., databases, bookmarking engines). | 2 |
K0129 | Knowledge of command-line tools (e.g., mkdir, mv, ls, passwd, grep). | 1 |
K0139 | Knowledge of interpreted and compiled computer languages. | 8 |
K0140 | Knowledge of secure coding techniques. | 3 |
K0193 | Knowledge of advanced data remediation security features in databases. | 1 |
K0197 | Knowledge of database access application programming interfaces (e.g., Java Database Connectivity [JDBC]). | 2 |
K0229 | Knowledge of applications that can log errors, exceptions, and application faults and logging. | 1 |
K0236 | Knowledge of how to utilize Hadoop, Java, Python, SQL, Hive, and Pig to explore data. | 1 |
K0238 | Knowledge of machine learning theory and principles. | 1 |
K0325 | Knowledge of Information Theory (e.g., source coding, channel coding, algorithm complexity theory, and data compression). | 6 |
K0420 | Knowledge of database theory. | 4 |
Skills 26
Code | Description | Work Roles |
---|---|---|
S0013 | Skill in conducting queries and developing algorithms to analyze data structures. | 2 |
S0017 | Skill in creating and utilizing mathematical or statistical models. | 3 |
S0202 | Skill in data mining techniques (e.g., searching file systems) and analysis. | 2 |
S0028 | Skill in developing data dictionaries. | 1 |
S0029 | Skill in developing data models. | 1 |
S0037 | Skill in generating queries and reports. | 2 |
S0060 | Skill in writing code in a currently supported programming language (e.g., Java, C++). | 7 |
S0088 | Skill in using binary analysis tools (e.g., Hexedit, command code xxd, hexdump). | 3 |
S0089 | Skill in one-way hash functions (e.g., Secure Hash Algorithm [SHA], Message Digest Algorithm [MD5]). | 3 |
S0094 | Skill in reading Hexadecimal data. | 1 |
S0095 | Skill in identifying common encoding techniques (e.g., Exclusive Disjunction [XOR], American Standard Code for Information Interchange [ASCII], Unicode, Base64, Uuencode, Uniform Resource Locator [URL] encode). | 1 |
S0103 | Skill in assessing the predictive power and subsequent generalizability of a model. | 1 |
S0106 | Skill in data pre-processing (e.g., imputation, dimensionality reduction, normalization, transformation, extraction, filtering, smoothing). | 1 |
S0109 | Skill in identifying hidden patterns or relationships. | 1 |
S0113 | Skill in performing format conversions to create a standard representation of the data. | 1 |
S0114 | Skill in performing sensitivity analysis. | 1 |
S0118 | Skill in developing machine understandable semantic ontologies. | 1 |
S0119 | Skill in Regression Analysis (e.g., Hierarchical Stepwise, Generalized Linear Model, Ordinary Least Squares, Tree-Based Methods, Logistic). | 1 |
S0123 | Skill in transformation analytics (e.g., aggregation, enrichment, processing). | 1 |
S0125 | Skill in using basic descriptive statistics and techniques (e.g., normality, model distribution, scatter plots). | 1 |
S0126 | Skill in using data analysis tools (e.g., Excel, STATA SAS, SPSS). | 1 |
S0127 | Skill in using data mapping tools. | 1 |
S0129 | Skill in using outlier identification and removal techniques. | 1 |
S0130 | Skill in writing scripts using R, Python, PIG, HIVE, SQL, etc. | 1 |
S0160 | Skill in the use of design modeling (e.g., unified modeling language). | 3 |
S0369 | Skill to identify sources, characteristics, and uses of the organization’s data assets. | 1 |
Abilities 5
Code | Description | Work Roles |
---|---|---|
A0029 | Ability to build complex data structures and high-level programming languages. | 1 |
A0035 | Ability to dissect a problem and examine the interrelationships between data that may appear unrelated. | 2 |
A0036 | Ability to identify basic common coding flaws at a high level. | 2 |
A0041 | Ability to use data visualization tools (e.g., Flare, HighCharts, AmCharts, D3.js, Processing, Google Visualization API, Tableau, Raphael.js). | 1 |
A0066 | Ability to accurately and completely source all data used in intelligence, assessment and/or planning products. | 12 |
Tasks 23
Code | Description | Work Roles |
---|---|---|
T0342 | Analyze data sources to provide actionable recommendations. | 1 |
T0347 | Assess the validity of source data and subsequent findings. | 1 |
T0349 | Collect metrics and trending data. | 1 |
T0351 | Conduct hypothesis testing using statistical processes. | 1 |
T0353 | Confer with systems analysts, engineers, programmers, and others to design application. | 1 |
T0361 | Develop and facilitate data-gathering methods. | 1 |
T0366 | Develop strategic insights from large data sets. | 1 |
T0381 | Present technical information to technical and nontechnical audiences. | 3 |
T0382 | Present data in creative formats. | 2 |
T0383 | Program custom algorithms. | 1 |
T0385 | Provide actionable recommendations to critical stakeholders based on data analysis and findings. | 1 |
T0392 | Utilize technical documentation or resources to implement a new mathematical, data science, or computer science method. | 1 |
T0402 | Effectively allocate storage capacity in the design of data management systems. | 1 |
T0403 | Read, interpret, write, modify, and execute simple scripts (e.g., Perl, VBScript) on Windows and UNIX systems (e.g., those that perform tasks such as: parsing large data files, automating manual tasks, and fetching/processing remote data). | 2 |
T0404 | Utilize different programming languages to write code, open files, read files, and write output to different files. | 1 |
T0405 | Utilize open source language such as R and apply quantitative techniques (e.g., descriptive and inferential statistics, sampling, experimental design, parametric and non-parametric tests of difference, ordinary least squares regression, general line). | 1 |
T0460 | Develop and implement data mining and data warehousing programs. | 1 |
T0007 | Analyze and define data requirements and specifications. | 1 |
T0008 | Analyze and plan for anticipated changes in data capacity requirements. | 2 |
T0068 | Develop data standards, policies, and procedures. | 1 |
T0146 | Manage the compilation, cataloging, caching, distribution, and retrieval of data. | 2 |
T0195 | Provide a managed flow of relevant information (via web-based portals or other means) based on mission requirements. | 1 |
T0210 | Provide recommendations on new database technologies and architectures. | 2 |